NVIDIA

Nemotron 3 Ultra 550B A55B (Reasoning)

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.37
Output$1.08
Blended (3:1)$0.55

Source: Artificial Analysis

Performance

Output Speed390 tok/s
Time to First Token502ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Nemotron 3 Ultra 550B A55B (Reasoning)Current
$0.37$1.08390 tok/s
ERNIE 4.5 300B A47B
$0.28$1.1024 tok/s
DeepSeek R1 Distill Llama 70B
$0.70$1.0545 tok/s
Step 3.7 Flash
$0.20$1.15392 tok/s
Qwen3 8B (Reasoning)
$0.11$1.1566 tok/s
Qwen3.7 Plus
$0.40$1.1654 tok/s

Example Costs

Single Request
$0.0009
1.0K in / 500 out
1K Requests/day
$0.9100
1.0M in / 500.0K out
10K Requests/day
$9.10
10.0M in / 5.0M out