Alibaba

Qwen3 8B (Non-reasoning)

AI model by Alibaba. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.18
Output$0.70
Blended (3:1)$0.31

Source: Artificial Analysis

Performance

Output Speed81 tok/s
Time to First Token953ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Qwen3 8B (Non-reasoning)Current
$0.18$0.7081 tok/s
Llama 3.3 Instruct 70B
$0.58$0.7199 tok/s
Llama 3.2 Instruct 90B (Vision)
$0.72$0.7242 tok/s
Llama 4 Scout
$0.17$0.66129 tok/s
NVIDIA Nemotron 3 Super 120B A12B (Reasoning)
$0.30$0.75370 tok/s
Mercury 2
$0.25$0.75907 tok/s

Example Costs

Single Request
$0.0005
1.0K in / 500 out
1K Requests/day
$0.5300
1.0M in / 500.0K out
10K Requests/day
$5.30
10.0M in / 5.0M out