Compare/Qwen3 32B (Reasoning) vs Grok 4.1 Fast (Reasoning)

Qwen3 32B (Reasoning)vsGrok 4.1 Fast (Reasoning)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Alibaba

Qwen3 32B (Reasoning)

Input
$0.195/M
Output
$0.52/M
Speed
104 tok/s
TTFT
1.07s
xAI

Grok 4.1 Fast (Reasoning)

Input
$0.2/M
Output
$0.5/M
Speed
101 tok/s
TTFT
8.89s

Winner by Category

Cheaper
Grok 4.1 Fast (Reasoning)
Faster (tok/s)
Qwen3 32B (Reasoning)
Lower Latency
Qwen3 32B (Reasoning)
Benchmarks (1-11)
Grok 4.1 Fast (Reasoning)

Pricing Comparison

MetricQwen3 32B (Reasoning)Grok 4.1 Fast (Reasoning)
Input ($/M tokens)$0.195$0.2
Output ($/M tokens)$0.52$0.5
Cost for 1M input + 100K output tokens:
Qwen3 32B (Reasoning)$0.25
Grok 4.1 Fast (Reasoning)$0.25

Speed Comparison

Output Speed (tokens/s) — higher is better
Qwen3 32B (Reasoning)
104 tok/s
Grok 4.1 Fast (Reasoning)
101 tok/s
Time to First Token (seconds) — lower is better
Qwen3 32B (Reasoning)
1.07s
Grok 4.1 Fast (Reasoning)
8.89s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
16.538.6
Coding Index
13.830.9
Math Index
73.089.3
GPQA Diamond
66.8%85.3%
MMLU-Pro
79.8%85.4%
LiveCodeBench
54.6%82.2%
AIME 2025
73.0%89.3%
MATH-500
96.1%
Humanity's Last Exam
8.3%17.6%
SciCode
35.4%44.2%
IFBench
36.3%52.7%
TerminalBench
3.0%24.2%
Qwen3 32B (Reasoning)1 wins
11 winsGrok 4.1 Fast (Reasoning)

Frequently Asked Questions

Which is cheaper, Qwen3 32B (Reasoning) or Grok 4.1 Fast (Reasoning)?

Grok 4.1 Fast (Reasoning) is cheaper overall. Its blended price (3:1 input/output ratio) is $0.28/M tokens vs $0.28/M for Qwen3 32B (Reasoning).

Which model performs better on benchmarks?

Grok 4.1 Fast (Reasoning) wins 11 out of 12 benchmarks compared to 1 for Qwen3 32B (Reasoning). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Qwen3 32B (Reasoning) generates tokens faster at 104 tok/s vs 101 tok/s. Qwen3 32B (Reasoning) also has lower time-to-first-token (1.07s vs 8.89s).

When should I use Qwen3 32B (Reasoning) vs Grok 4.1 Fast (Reasoning)?

Choose based on your priorities: Grok 4.1 Fast (Reasoning) for lower cost, Grok 4.1 Fast (Reasoning) for stronger benchmark performance, and Qwen3 32B (Reasoning) for faster generation. For latency-sensitive apps, check the TTFT comparison above.