Compare/Qwen3 235B A22B 2507 Instruct vs DeepSeek V3 (Dec '24)

Qwen3 235B A22B 2507 InstructvsDeepSeek V3 (Dec '24)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Alibaba

Qwen3 235B A22B 2507 Instruct

Input
$0.2/M
Output
$0.825/M
Speed
69 tok/s
TTFT
1.23s
DeepSeek

DeepSeek V3 (Dec '24)

Input
$0.4/M
Output
$0.89/M
Speed
TTFT

Winner by Category

Cheaper
Qwen3 235B A22B 2507 Instruct
Faster (tok/s)
Qwen3 235B A22B 2507 Instruct
Lower Latency
DeepSeek V3 (Dec '24)
Benchmarks (12-0)
Qwen3 235B A22B 2507 Instruct

Pricing Comparison

MetricQwen3 235B A22B 2507 InstructDeepSeek V3 (Dec '24)
Input ($/M tokens)$0.2$0.4
Output ($/M tokens)$0.825$0.89
Cost for 1M input + 100K output tokens:
Qwen3 235B A22B 2507 Instruct$0.28
DeepSeek V3 (Dec '24)$0.49

Speed Comparison

Output Speed (tokens/s) — higher is better
Qwen3 235B A22B 2507 Instruct
69 tok/s
DeepSeek V3 (Dec '24)
Time to First Token (seconds) — lower is better
Qwen3 235B A22B 2507 Instruct
1.23s
DeepSeek V3 (Dec '24)

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
25.016.5
Coding Index
22.116.4
Math Index
71.726.0
GPQA Diamond
75.3%55.7%
MMLU-Pro
82.8%75.2%
LiveCodeBench
52.4%35.9%
AIME 2025
71.7%26.0%
MATH-500
98.0%88.7%
Humanity's Last Exam
10.6%3.6%
SciCode
36.0%35.4%
IFBench
46.1%34.8%
TerminalBench
15.2%6.8%
Qwen3 235B A22B 2507 Instruct12 wins
0 winsDeepSeek V3 (Dec '24)

Frequently Asked Questions

Which is cheaper, Qwen3 235B A22B 2507 Instruct or DeepSeek V3 (Dec '24)?

Qwen3 235B A22B 2507 Instruct is cheaper overall. Its blended price (3:1 input/output ratio) is $0.36/M tokens vs $0.52/M for DeepSeek V3 (Dec '24).

Which model performs better on benchmarks?

Qwen3 235B A22B 2507 Instruct wins 12 out of 12 benchmarks compared to 0 for DeepSeek V3 (Dec '24). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Qwen3 235B A22B 2507 Instruct generates tokens faster at 69 tok/s vs 0 tok/s. However, DeepSeek V3 (Dec '24) has lower time-to-first-token (0.00s vs 1.23s).

When should I use Qwen3 235B A22B 2507 Instruct vs DeepSeek V3 (Dec '24)?

Choose based on your priorities: Qwen3 235B A22B 2507 Instruct for lower cost, Qwen3 235B A22B 2507 Instruct for stronger benchmark performance, and Qwen3 235B A22B 2507 Instruct for faster generation. For latency-sensitive apps, check the TTFT comparison above.