Nous Research

Hermes 3 - Llama-3.1 70B

AI model by Nous Research. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.30
Output$0.30
Blended (3:1)$0.30

Source: Artificial Analysis

Performance

Output Speed40 tok/s
Time to First Token336ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Hermes 3 - Llama-3.1 70BCurrent
$0.30$0.3040 tok/s
Step 3.5 Flash
$0.10$0.3093 tok/s
MiMo-V2-Flash (Feb 2026)
$0.10$0.30134 tok/s
MiMo-V2-Flash (Non-reasoning)
$0.10$0.30130 tok/s
Mistral Small 3
$0.10$0.30162 tok/s
Mistral Small 3.1
$0.10$0.30170 tok/s

Example Costs

Single Request
$0.0004
1.0K in / 500 out
1K Requests/day
$0.4500
1.0M in / 500.0K out
10K Requests/day
$4.50
10.0M in / 5.0M out