Nous Research
Hermes 4 - Llama-3.1 70B (Reasoning)
AI model by Nous Research. Real-time pricing and benchmark data.
Benchmarks
Coding Index14.4
Math Index68.7
GPQA Diamond69.9%
MMLU-Pro81.1%
LiveCodeBench65.3%
AIME 202568.7%
SciCode34.1%
IFBench31.3%
TerminalBench4.5%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Hermes 4 - Llama-3.1 70B (Reasoning)Current | $0.13 | $0.40 | 81 tok/s |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | $0.10 | $0.40 | 344 tok/s |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | $0.10 | $0.40 | 365 tok/s |
| Llama Nemotron Super 49B v1.5 (Non-reasoning) | $0.10 | $0.40 | 51 tok/s |
| Llama Nemotron Super 49B v1.5 (Reasoning) | $0.10 | $0.40 | 51 tok/s |
| GPT-5 nano (minimal) | $0.05 | $0.40 | 163 tok/s |
Compare Hermes 4 - Llama-3.1 70B (Reasoning) with
Example Costs
Single Request
$0.0003
1.0K in / 500 out
1K Requests/day
$0.3300
1.0M in / 500.0K out
10K Requests/day
$3.30
10.0M in / 5.0M out