Meta

Llama 3.1 Instruct 70B

AI model by Meta. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.56
Output$0.56
Blended (3:1)$0.56

Source: Artificial Analysis

Performance

Output Speed31 tok/s
Time to First Token554ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Llama 3.1 Instruct 70BCurrent
$0.56$0.5631 tok/s
Ring-flash-2.0
$0.14$0.5781 tok/s
Ling-flash-2.0
$0.14$0.5764 tok/s
Seed-OSS-36B-Instruct
$0.21$0.5740 tok/s
gpt-oss-120B (low)
$0.15$0.60255 tok/s
gpt-oss-120B (high)
$0.15$0.60253 tok/s

Example Costs

Single Request
$0.0008
1.0K in / 500 out
1K Requests/day
$0.8400
1.0M in / 500.0K out
10K Requests/day
$8.40
10.0M in / 5.0M out