Nous Research

Hermes 4 - Llama-3.1 405B (Reasoning)

AI model by Nous Research. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$1.00
Output$3.00
Blended (3:1)$1.50

Source: Artificial Analysis

Performance

Output Speed31 tok/s
Time to First Token710ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Hermes 4 - Llama-3.1 405B (Reasoning)Current
$1.00$3.0031 tok/s
Gemini 3 Flash Preview (Non-reasoning)
$0.50$3.00190 tok/s
Gemini 3 Flash Preview (Reasoning)
$0.50$3.00180 tok/s
Kimi K2.5 (Reasoning)
$0.60$3.0036 tok/s
Kimi K2.5 (Non-reasoning)
$0.60$3.0033 tok/s
MiMo-V2-Pro
$1.00$3.0083 tok/s

Example Costs

Single Request
$0.0025
1.0K in / 500 out
1K Requests/day
$2.50
1.0M in / 500.0K out
10K Requests/day
$25.00
10.0M in / 5.0M out