NVIDIA
Llama Nemotron Super 49B v1.5 (Reasoning)
AI model by NVIDIA. Real-time pricing and benchmark data.
Benchmarks
Coding Index15.2
Math Index76.7
GPQA Diamond74.8%
MMLU-Pro81.4%
LiveCodeBench73.7%
AIME 202576.7%
MATH-50098.3%
SciCode34.8%
IFBench37.0%
TerminalBench5.3%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Llama Nemotron Super 49B v1.5 (Reasoning)Current | $0.10 | $0.40 | 51 tok/s |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | $0.10 | $0.40 | 344 tok/s |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | $0.10 | $0.40 | 365 tok/s |
| Hermes 4 - Llama-3.1 70B (Reasoning) | $0.13 | $0.40 | 81 tok/s |
| Hermes 4 - Llama-3.1 70B (Non-reasoning) | $0.13 | $0.40 | 81 tok/s |
| GPT-5 nano (minimal) | $0.05 | $0.40 | 163 tok/s |
Compare Llama Nemotron Super 49B v1.5 (Reasoning) with
Example Costs
Single Request
$0.0003
1.0K in / 500 out
1K Requests/day
$0.3000
1.0M in / 500.0K out
10K Requests/day
$3.00
10.0M in / 5.0M out