NVIDIA
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)
AI model by NVIDIA. Real-time pricing and benchmark data.
Benchmarks
Coding Index13.1
Math Index63.7
GPQA Diamond72.8%
MMLU-Pro82.5%
LiveCodeBench64.1%
AIME 202563.7%
MATH-50095.2%
SciCode34.7%
IFBench38.2%
TerminalBench2.3%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Llama 3.1 Nemotron Ultra 253B v1 (Reasoning)Current | $0.60 | $1.80 | 42 tok/s |
| GLM-4.5V (Non-reasoning) | $0.60 | $1.80 | 69 tok/s |
| GLM-4.5V (Reasoning) | $0.60 | $1.80 | 56 tok/s |
| Llama 3 Instruct 70B | $0.58 | $1.75 | 40 tok/s |
| GLM-4.5 (Reasoning) | $0.49 | $1.90 | 48 tok/s |
| DeepSeek V3.1 (Reasoning) | $0.60 | $1.70 | 0 tok/s |
Compare Llama 3.1 Nemotron Ultra 253B v1 (Reasoning) with
Example Costs
Single Request
$0.0015
1.0K in / 500 out
1K Requests/day
$1.50
1.0M in / 500.0K out
10K Requests/day
$15.00
10.0M in / 5.0M out