Compare/o3 vs Qwen3.6 Max Preview

o3vsQwen3.6 Max Preview

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

OpenAI

o3

Input
$2/M
Output
$8/M
Speed
111 tok/s
TTFT
6.00s
Alibaba

Qwen3.6 Max Preview

Input
$1.3/M
Output
$7.8/M
Speed
38 tok/s
TTFT
2.16s

Winner by Category

Cheaper
Qwen3.6 Max Preview
Faster (tok/s)
o3
Lower Latency
Qwen3.6 Max Preview
Benchmarks (5-7)
Qwen3.6 Max Preview

Pricing Comparison

Metrico3Qwen3.6 Max Preview
Input ($/M tokens)$2$1.3
Output ($/M tokens)$8$7.8
Cost for 1M input + 100K output tokens:
o3$2.80
Qwen3.6 Max Preview$2.08

Speed Comparison

Output Speed (tokens/s) — higher is better
o3
111 tok/s
Qwen3.6 Max Preview
38 tok/s
Time to First Token (seconds) — lower is better
o3
6.00s
Qwen3.6 Max Preview
2.16s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
38.451.8
Coding Index
38.444.9
Math Index
88.3
GPQA Diamond
82.7%88.8%
MMLU-Pro
85.3%
LiveCodeBench
80.8%
AIME 2025
88.3%
MATH-500
99.2%
Humanity's Last Exam
20.0%28.9%
SciCode
41.0%46.9%
IFBench
71.4%76.6%
TerminalBench
37.1%43.9%
o35 wins
7 winsQwen3.6 Max Preview

Frequently Asked Questions

Which is cheaper, o3 or Qwen3.6 Max Preview?

Qwen3.6 Max Preview is cheaper overall. Its blended price (3:1 input/output ratio) is $2.92/M tokens vs $3.50/M for o3.

Which model performs better on benchmarks?

Qwen3.6 Max Preview wins 7 out of 12 benchmarks compared to 5 for o3. See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

o3 generates tokens faster at 111 tok/s vs 38 tok/s. However, Qwen3.6 Max Preview has lower time-to-first-token (2.16s vs 6.00s).

When should I use o3 vs Qwen3.6 Max Preview?

Choose based on your priorities: Qwen3.6 Max Preview for lower cost, Qwen3.6 Max Preview for stronger benchmark performance, and o3 for faster generation. For latency-sensitive apps, check the TTFT comparison above.