NVIDIA
Llama Nemotron Super 49B v1.5 (Non-reasoning)
AI model by NVIDIA. Real-time pricing and benchmark data.
Benchmarks
Coding Index10.5
Math Index8.0
GPQA Diamond48.1%
MMLU-Pro69.2%
LiveCodeBench29.0%
AIME 20258.0%
MATH-50077.0%
SciCode23.8%
IFBench32.9%
TerminalBench3.8%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Llama Nemotron Super 49B v1.5 (Non-reasoning)Current | $0.10 | $0.40 | 51 tok/s |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | $0.10 | $0.40 | 344 tok/s |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | $0.10 | $0.40 | 365 tok/s |
| Hermes 4 - Llama-3.1 70B (Reasoning) | $0.13 | $0.40 | 81 tok/s |
| Hermes 4 - Llama-3.1 70B (Non-reasoning) | $0.13 | $0.40 | 81 tok/s |
| GPT-5 nano (minimal) | $0.05 | $0.40 | 163 tok/s |
Compare Llama Nemotron Super 49B v1.5 (Non-reasoning) with
Example Costs
Single Request
$0.0003
1.0K in / 500 out
1K Requests/day
$0.3000
1.0M in / 500.0K out
10K Requests/day
$3.00
10.0M in / 5.0M out