NVIDIA

Nemotron 3 Nano Omni 30B A3B Reasoning

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.07
Output$0.30
Blended (3:1)$0.13

Source: Artificial Analysis

Performance

Output Speed295 tok/s
Time to First Token582ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Nemotron 3 Nano Omni 30B A3B ReasoningCurrent
$0.07$0.30295 tok/s
Gemma 4 12B (Reasoning)
$0.10$0.30159 tok/s
Step 3.5 Flash 2603
$0.10$0.30214 tok/s
MiMo-V2-Flash (Feb 2026)
$0.10$0.3095 tok/s
MiMo-V2-Flash (Non-reasoning)
$0.10$0.3094 tok/s
Ling 2.6 Flash
$0.10$0.300 tok/s

Example Costs

Single Request
$0.0002
1.0K in / 500 out
1K Requests/day
$0.2250
1.0M in / 500.0K out
10K Requests/day
$2.25
10.0M in / 5.0M out