Nous Research
Hermes 4 - Llama-3.1 70B (Non-reasoning)
AI model by Nous Research. Real-time pricing and benchmark data.
Benchmarks
Coding Index9.2
Math Index11.3
GPQA Diamond49.1%
MMLU-Pro66.4%
LiveCodeBench26.9%
AIME 202511.3%
SciCode27.7%
IFBench29.0%
TerminalBench0.0%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Hermes 4 - Llama-3.1 70B (Non-reasoning)Current | $0.13 | $0.40 | 81 tok/s |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning) | $0.10 | $0.40 | 344 tok/s |
| Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning) | $0.10 | $0.40 | 365 tok/s |
| Llama Nemotron Super 49B v1.5 (Non-reasoning) | $0.10 | $0.40 | 51 tok/s |
| Llama Nemotron Super 49B v1.5 (Reasoning) | $0.10 | $0.40 | 51 tok/s |
| GPT-5 nano (minimal) | $0.05 | $0.40 | 163 tok/s |
Compare Hermes 4 - Llama-3.1 70B (Non-reasoning) with
Example Costs
Single Request
$0.0003
1.0K in / 500 out
1K Requests/day
$0.3300
1.0M in / 500.0K out
10K Requests/day
$3.30
10.0M in / 5.0M out