NVIDIA

NVIDIA Nemotron Nano 9B V2 (Reasoning)

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.04
Output$0.16
Blended (3:1)$0.07

Source: Artificial Analysis

Performance

Output Speed127 tok/s
Time to First Token246ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
NVIDIA Nemotron Nano 9B V2 (Reasoning)Current
$0.04$0.16127 tok/s
Llama 3.2 Instruct 11B (Vision)
$0.16$0.1681 tok/s
Ministral 3 8B
$0.15$0.15192 tok/s
Qwen3.5 9B (Reasoning)
$0.10$0.1561 tok/s
Solar Mini
$0.15$0.1592 tok/s
Llama 3 Instruct 8B
$0.04$0.1484 tok/s

Example Costs

Single Request
$0.0001
1.0K in / 500 out
1K Requests/day
$0.1200
1.0M in / 500.0K out
10K Requests/day
$1.20
10.0M in / 5.0M out