NVIDIA

NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.05
Output$0.20
Blended (3:1)$0.09

Source: Artificial Analysis

Performance

Output Speed80 tok/s
Time to First Token292ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
NVIDIA Nemotron 3 Nano 30B A3B (Non-reasoning)Current
$0.05$0.2080 tok/s
gpt-oss-20B (high)
$0.06$0.20316 tok/s
gpt-oss-20B (low)
$0.06$0.20321 tok/s
Ministral 3 14B
$0.20$0.20125 tok/s
Olmo 3 7B Instruct
$0.10$0.200 tok/s
Apertus 8B Instruct
$0.10$0.20140 tok/s

Example Costs

Single Request
$0.0001
1.0K in / 500 out
1K Requests/day
$0.1500
1.0M in / 500.0K out
10K Requests/day
$1.50
10.0M in / 5.0M out