NVIDIA
Llama 3.1 Nemotron Instruct 70B
AI model by NVIDIA. Real-time pricing and benchmark data.
Benchmarks
Coding Index10.8
Math Index11.0
GPQA Diamond46.5%
MMLU-Pro69.0%
LiveCodeBench16.9%
AIME 202511.0%
MATH-50073.3%
SciCode23.3%
IFBench30.7%
TerminalBench4.5%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Llama 3.1 Nemotron Instruct 70BCurrent | $1.20 | $1.20 | 33 tok/s |
| MiniMax-M2.7 | $0.30 | $1.20 | 44 tok/s |
| KAT-Coder-Pro V1 | $0.30 | $1.20 | 38 tok/s |
| Qwen3 Coder Next | $0.35 | $1.20 | 166 tok/s |
| MiniMax-M2 | $0.30 | $1.20 | 49 tok/s |
| MiniMax-M2.5 | $0.30 | $1.20 | 47 tok/s |
Compare Llama 3.1 Nemotron Instruct 70B with
Example Costs
Single Request
$0.0018
1.0K in / 500 out
1K Requests/day
$1.80
1.0M in / 500.0K out
10K Requests/day
$18.00
10.0M in / 5.0M out