NVIDIA
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)
AI model by NVIDIA. Real-time pricing and benchmark data.
Benchmarks
Math Index50.0
GPQA Diamond40.8%
MMLU-Pro55.6%
LiveCodeBench49.3%
AIME 202550.0%
MATH-50094.7%
SciCode10.1%
IFBench25.5%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning)Current | $0.00 | $0.00 | 0 tok/s |
| GPT-5.4 nano (Non-Reasoning) | $0.20 | $1.25 | 177 tok/s |
| gpt-oss-120B (low) | $0.15 | $0.60 | 255 tok/s |
| GPT-5.4 nano (medium) | $0.20 | $1.25 | 171 tok/s |
| GPT-5.4 mini (xhigh) | $0.75 | $4.50 | 219 tok/s |
| GPT-5.4 mini (Non-Reasoning) | $0.75 | $4.50 | 202 tok/s |
Compare Llama 3.1 Nemotron Nano 4B v1.1 (Reasoning) with
Example Costs
Single Request
<$0.0001
1.0K in / 500 out
1K Requests/day
<$0.0001
1.0M in / 500.0K out
10K Requests/day
<$0.0001
10.0M in / 5.0M out