NVIDIA

NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.06
Output$0.24
Blended (3:1)$0.10

Source: Artificial Analysis

Performance

Output Speed138 tok/s
Time to First Token1001ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
NVIDIA Nemotron 3 Nano 30B A3B (Reasoning)Current
$0.06$0.24138 tok/s
Nova Lite
$0.06$0.24190 tok/s
Granite 4.0 H Small
$0.06$0.25524 tok/s
Llama 2 Chat 7B
$0.05$0.25120 tok/s
Mistral 7B Instruct
$0.25$0.25188 tok/s
Granite 3.3 8B (Non-reasoning)
$0.03$0.25386 tok/s

Example Costs

Single Request
$0.0002
1.0K in / 500 out
1K Requests/day
$0.1800
1.0M in / 500.0K out
10K Requests/day
$1.80
10.0M in / 5.0M out