Compare/GPT-5.1 Codex mini (high) vs Qwen3.5 35B A3B (Reasoning)

GPT-5.1 Codex mini (high)vsQwen3.5 35B A3B (Reasoning)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

OpenAI

GPT-5.1 Codex mini (high)

Input
$0.25/M
Output
$2/M
Speed
186 tok/s
TTFT
6.58s
Alibaba

Qwen3.5 35B A3B (Reasoning)

Input
$0.25/M
Output
$2/M
Speed
111 tok/s
TTFT
1.10s

Winner by Category

Cheaper
Tie
Faster (tok/s)
GPT-5.1 Codex mini (high)
Lower Latency
Qwen3.5 35B A3B (Reasoning)
Benchmarks (8-3)
GPT-5.1 Codex mini (high)

Pricing Comparison

MetricGPT-5.1 Codex mini (high)Qwen3.5 35B A3B (Reasoning)
Input ($/M tokens)$0.25$0.25
Output ($/M tokens)$2$2
Cost for 1M input + 100K output tokens:
GPT-5.1 Codex mini (high)$0.45
Qwen3.5 35B A3B (Reasoning)$0.45

Speed Comparison

Output Speed (tokens/s) — higher is better
GPT-5.1 Codex mini (high)
186 tok/s
Qwen3.5 35B A3B (Reasoning)
111 tok/s
Time to First Token (seconds) — lower is better
GPT-5.1 Codex mini (high)
6.58s
Qwen3.5 35B A3B (Reasoning)
1.10s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
38.637.1
Coding Index
36.430.3
Math Index
91.7
GPQA Diamond
81.3%84.5%
MMLU-Pro
82.0%
LiveCodeBench
83.6%
AIME 2025
91.7%
MATH-500
Humanity's Last Exam
16.9%19.7%
SciCode
42.6%37.7%
IFBench
67.9%72.5%
TerminalBench
33.3%26.5%
GPT-5.1 Codex mini (high)8 wins
3 winsQwen3.5 35B A3B (Reasoning)

Frequently Asked Questions

Which is cheaper, GPT-5.1 Codex mini (high) or Qwen3.5 35B A3B (Reasoning)?

Both models have similar pricing. Check the detailed breakdown above for input vs output token costs.

Which model performs better on benchmarks?

GPT-5.1 Codex mini (high) wins 8 out of 12 benchmarks compared to 3 for Qwen3.5 35B A3B (Reasoning). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

GPT-5.1 Codex mini (high) generates tokens faster at 186 tok/s vs 111 tok/s. However, Qwen3.5 35B A3B (Reasoning) has lower time-to-first-token (1.10s vs 6.58s).

When should I use GPT-5.1 Codex mini (high) vs Qwen3.5 35B A3B (Reasoning)?

Choose based on your priorities: both are similarly priced, GPT-5.1 Codex mini (high) for stronger benchmark performance, and GPT-5.1 Codex mini (high) for faster generation. For latency-sensitive apps, check the TTFT comparison above.