Meta

Llama 3.1 Instruct 405B

AI model by Meta. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$2.75
Output$6.50
Blended (3:1)$3.69

Source: Artificial Analysis

Performance

Output Speed31 tok/s
Time to First Token469ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Llama 3.1 Instruct 405BCurrent
$2.75$6.5031 tok/s
Qwen2.5 Max
$1.60$6.4048 tok/s
Grok 4.20 Beta 0309 (Reasoning)
$2.00$6.00238 tok/s
Grok 4.20 Beta 0309 (Non-reasoning)
$2.00$6.00189 tok/s
Qwen3 Next 80B A3B (Reasoning)
$0.50$6.00151 tok/s
Qwen3 Max Thinking
$1.20$6.0034 tok/s

Example Costs

Single Request
$0.0060
1.0K in / 500 out
1K Requests/day
$6.00
1.0M in / 500.0K out
10K Requests/day
$60.00
10.0M in / 5.0M out