Nous Research
Hermes 4 - Llama-3.1 405B (Non-reasoning)
AI model by Nous Research. Real-time pricing and benchmark data.
Benchmarks
Coding Index18.1
Math Index15.3
GPQA Diamond53.6%
MMLU-Pro72.9%
LiveCodeBench54.6%
AIME 202515.3%
SciCode34.6%
IFBench34.8%
TerminalBench9.8%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Hermes 4 - Llama-3.1 405B (Non-reasoning)Current | $1.00 | $3.00 | 33 tok/s |
| Gemini 3 Flash Preview (Non-reasoning) | $0.50 | $3.00 | 190 tok/s |
| Gemini 3 Flash Preview (Reasoning) | $0.50 | $3.00 | 180 tok/s |
| Kimi K2.5 (Reasoning) | $0.60 | $3.00 | 36 tok/s |
| Kimi K2.5 (Non-reasoning) | $0.60 | $3.00 | 33 tok/s |
| MiMo-V2-Pro | $1.00 | $3.00 | 83 tok/s |
Compare Hermes 4 - Llama-3.1 405B (Non-reasoning) with
Example Costs
Single Request
$0.0025
1.0K in / 500 out
1K Requests/day
$2.50
1.0M in / 500.0K out
10K Requests/day
$25.00
10.0M in / 5.0M out