Alibaba

Qwen3 32B (Non-reasoning)

AI model by Alibaba. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.15
Output$0.59
Blended (3:1)$0.26

Source: Artificial Analysis

Performance

Output Speed103 tok/s
Time to First Token1091ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Qwen3 32B (Non-reasoning)Current
$0.15$0.59103 tok/s
gpt-oss-120B (low)
$0.15$0.60232 tok/s
gpt-oss-120B (high)
$0.15$0.60208 tok/s
Mistral Small 4 (Reasoning)
$0.15$0.60163 tok/s
Mistral Small 4 (Non-reasoning)
$0.15$0.60159 tok/s
NVIDIA Nemotron Nano 12B v2 VL (Reasoning)
$0.20$0.600 tok/s

Example Costs

Single Request
$0.0004
1.0K in / 500 out
1K Requests/day
$0.4450
1.0M in / 500.0K out
10K Requests/day
$4.45
10.0M in / 5.0M out