NVIDIA
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)
AI model by NVIDIA. Real-time pricing and benchmark data.
Benchmarks
Coding Index7.6
Math Index7.7
GPQA Diamond51.7%
MMLU-Pro69.8%
LiveCodeBench28.0%
AIME 20257.7%
MATH-50077.5%
SciCode22.9%
IFBench39.5%
TerminalBench0.0%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Llama 3.3 Nemotron Super 49B v1 (Non-reasoning)Current | $0.00 | $0.00 | 0 tok/s |
| GPT-5.4 nano (Non-Reasoning) | $0.20 | $1.25 | 177 tok/s |
| gpt-oss-120B (low) | $0.15 | $0.60 | 255 tok/s |
| GPT-5.4 nano (medium) | $0.20 | $1.25 | 171 tok/s |
| GPT-5.4 mini (xhigh) | $0.75 | $4.50 | 219 tok/s |
| GPT-5.4 mini (Non-Reasoning) | $0.75 | $4.50 | 202 tok/s |
Compare Llama 3.3 Nemotron Super 49B v1 (Non-reasoning) with
Example Costs
Single Request
<$0.0001
1.0K in / 500 out
1K Requests/day
<$0.0001
1.0M in / 500.0K out
10K Requests/day
<$0.0001
10.0M in / 5.0M out