Compare/Qwen3 VL 235B A22B (Reasoning) vs o3

Qwen3 VL 235B A22B (Reasoning)vso3

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Alibaba

Qwen3 VL 235B A22B (Reasoning)

Input
$0.7/M
Output
$8.4/M
Speed
58 tok/s
TTFT
1.11s
OpenAI

o3

Input
$2/M
Output
$8/M
Speed
94 tok/s
TTFT
7.87s

Winner by Category

Cheaper
Qwen3 VL 235B A22B (Reasoning)
Faster (tok/s)
o3
Lower Latency
Qwen3 VL 235B A22B (Reasoning)
Benchmarks (0-10)
o3

Pricing Comparison

MetricQwen3 VL 235B A22B (Reasoning)o3
Input ($/M tokens)$0.7$2
Output ($/M tokens)$8.4$8
Cost for 1M input + 100K output tokens:
Qwen3 VL 235B A22B (Reasoning)$1.54
o3$2.80

Speed Comparison

Output Speed (tokens/s) — higher is better
Qwen3 VL 235B A22B (Reasoning)
58 tok/s
o3
94 tok/s
Time to First Token (seconds) — lower is better
Qwen3 VL 235B A22B (Reasoning)
1.11s
o3
7.87s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
27.638.4
Coding Index
20.938.4
Math Index
88.388.3
GPQA Diamond
77.2%82.7%
MMLU-Pro
83.6%85.3%
LiveCodeBench
64.6%80.8%
AIME 2025
88.3%88.3%
MATH-500
99.2%
Humanity's Last Exam
10.1%20.0%
SciCode
39.9%41.0%
IFBench
56.5%71.4%
TerminalBench
11.4%37.1%
Qwen3 VL 235B A22B (Reasoning)0 wins
10 winso3

Frequently Asked Questions

Which is cheaper, Qwen3 VL 235B A22B (Reasoning) or o3?

Qwen3 VL 235B A22B (Reasoning) is cheaper overall. Its blended price (3:1 input/output ratio) is $2.63/M tokens vs $3.50/M for o3.

Which model performs better on benchmarks?

o3 wins 10 out of 12 benchmarks compared to 0 for Qwen3 VL 235B A22B (Reasoning). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

o3 generates tokens faster at 94 tok/s vs 58 tok/s. Qwen3 VL 235B A22B (Reasoning) also has lower time-to-first-token (1.11s vs 7.87s).

When should I use Qwen3 VL 235B A22B (Reasoning) vs o3?

Choose based on your priorities: Qwen3 VL 235B A22B (Reasoning) for lower cost, o3 for stronger benchmark performance, and o3 for faster generation. For latency-sensitive apps, check the TTFT comparison above.