Alibaba

QwQ 32B-Preview

AI model by Alibaba. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.12
Output$0.18
Blended (3:1)$0.14

Source: Artificial Analysis

Performance

Output Speed59 tok/s
Time to First Token491ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
QwQ 32B-PreviewCurrent
$0.12$0.1859 tok/s
NVIDIA Nemotron Nano 9B V2 (Non-reasoning)
$0.05$0.20157 tok/s
Llama 3.2 Instruct 11B (Vision)
$0.16$0.1681 tok/s
NVIDIA Nemotron Nano 9B V2 (Reasoning)
$0.04$0.16127 tok/s
gpt-oss-20B (high)
$0.06$0.20316 tok/s
gpt-oss-20B (low)
$0.06$0.20321 tok/s

Example Costs

Single Request
$0.0002
1.0K in / 500 out
1K Requests/day
$0.2100
1.0M in / 500.0K out
10K Requests/day
$2.10
10.0M in / 5.0M out