Meta

Llama 3.3 Instruct 70B

AI model by Meta. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.58
Output$0.71
Blended (3:1)$0.62

Source: Artificial Analysis

Performance

Output Speed91 tok/s
Time to First Token598ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Llama 3.3 Instruct 70BCurrent
$0.58$0.7191 tok/s
Mixtral 8x7B Instruct
$0.45$0.700 tok/s
Qwen3 VL 8B Instruct
$0.18$0.70142 tok/s
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
$0.30$0.75164 tok/s
Mercury 2
$0.25$0.75743 tok/s
Qwen3 VL 30B A3B (Reasoning)
$0.20$0.75126 tok/s

Example Costs

Single Request
$0.0009
1.0K in / 500 out
1K Requests/day
$0.9400
1.0M in / 500.0K out
10K Requests/day
$9.40
10.0M in / 5.0M out