Compare/GPT-5 mini (minimal) vs Qwen3.5 35B A3B (Reasoning)

GPT-5 mini (minimal)vsQwen3.5 35B A3B (Reasoning)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

OpenAI

GPT-5 mini (minimal)

Input
$0.25/M
Output
$2/M
Speed
98 tok/s
TTFT
0.86s
Alibaba

Qwen3.5 35B A3B (Reasoning)

Input
$0.25/M
Output
$2/M
Speed
111 tok/s
TTFT
1.10s

Winner by Category

Cheaper
Tie
Faster (tok/s)
Qwen3.5 35B A3B (Reasoning)
Lower Latency
GPT-5 mini (minimal)
Benchmarks (4-7)
Qwen3.5 35B A3B (Reasoning)

Pricing Comparison

MetricGPT-5 mini (minimal)Qwen3.5 35B A3B (Reasoning)
Input ($/M tokens)$0.25$0.25
Output ($/M tokens)$2$2
Cost for 1M input + 100K output tokens:
GPT-5 mini (minimal)$0.45
Qwen3.5 35B A3B (Reasoning)$0.45

Speed Comparison

Output Speed (tokens/s) — higher is better
GPT-5 mini (minimal)
98 tok/s
Qwen3.5 35B A3B (Reasoning)
111 tok/s
Time to First Token (seconds) — lower is better
GPT-5 mini (minimal)
0.86s
Qwen3.5 35B A3B (Reasoning)
1.10s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
20.737.1
Coding Index
21.930.3
Math Index
46.7
GPQA Diamond
68.7%84.5%
MMLU-Pro
77.5%
LiveCodeBench
54.5%
AIME 2025
46.7%
MATH-500
Humanity's Last Exam
5.0%19.7%
SciCode
36.9%37.7%
IFBench
45.6%72.5%
TerminalBench
14.4%26.5%
GPT-5 mini (minimal)4 wins
7 winsQwen3.5 35B A3B (Reasoning)

Frequently Asked Questions

Which is cheaper, GPT-5 mini (minimal) or Qwen3.5 35B A3B (Reasoning)?

Both models have similar pricing. Check the detailed breakdown above for input vs output token costs.

Which model performs better on benchmarks?

Qwen3.5 35B A3B (Reasoning) wins 7 out of 12 benchmarks compared to 4 for GPT-5 mini (minimal). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Qwen3.5 35B A3B (Reasoning) generates tokens faster at 111 tok/s vs 98 tok/s. GPT-5 mini (minimal) also has lower time-to-first-token (0.86s vs 1.10s).

When should I use GPT-5 mini (minimal) vs Qwen3.5 35B A3B (Reasoning)?

Choose based on your priorities: both are similarly priced, Qwen3.5 35B A3B (Reasoning) for stronger benchmark performance, and Qwen3.5 35B A3B (Reasoning) for faster generation. For latency-sensitive apps, check the TTFT comparison above.