AI2

Olmo 3.1 32B Instruct

AI model by AI2. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.20
Output$0.60
Blended (3:1)$0.30

Source: Artificial Analysis

Performance

Output Speed55 tok/s
Time to First Token230ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Olmo 3.1 32B InstructCurrent
$0.20$0.6055 tok/s
gpt-oss-120B (low)
$0.15$0.60255 tok/s
gpt-oss-120B (high)
$0.15$0.60253 tok/s
Mistral Small 4 (Non-reasoning)
$0.15$0.60159 tok/s
Mistral Small 4 (Reasoning)
$0.15$0.600 tok/s
NVIDIA Nemotron Nano 12B v2 VL (Non-reasoning)
$0.20$0.60138 tok/s

Example Costs

Single Request
$0.0005
1.0K in / 500 out
1K Requests/day
$0.5000
1.0M in / 500.0K out
10K Requests/day
$5.00
10.0M in / 5.0M out