Compare/Qwen3 1.7B (Reasoning) vs Cogito v2.1 (Reasoning)

Qwen3 1.7B (Reasoning)vsCogito v2.1 (Reasoning)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Alibaba

Qwen3 1.7B (Reasoning)

Input
$0.11/M
Output
$1.26/M
Speed
140 tok/s
TTFT
0.88s
Deep Cogito

Cogito v2.1 (Reasoning)

Input
$1.25/M
Output
$1.25/M
Speed
92 tok/s
TTFT
0.27s

Winner by Category

Cheaper
Qwen3 1.7B (Reasoning)
Faster (tok/s)
Qwen3 1.7B (Reasoning)
Lower Latency
Cogito v2.1 (Reasoning)
Benchmarks (2-10)
Cogito v2.1 (Reasoning)

Pricing Comparison

MetricQwen3 1.7B (Reasoning)Cogito v2.1 (Reasoning)
Input ($/M tokens)$0.11$1.25
Output ($/M tokens)$1.26$1.25
Cost for 1M input + 100K output tokens:
Qwen3 1.7B (Reasoning)$0.24
Cogito v2.1 (Reasoning)$1.38

Speed Comparison

Output Speed (tokens/s) — higher is better
Qwen3 1.7B (Reasoning)
140 tok/s
Cogito v2.1 (Reasoning)
92 tok/s
Time to First Token (seconds) — lower is better
Qwen3 1.7B (Reasoning)
0.88s
Cogito v2.1 (Reasoning)
0.27s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
8.0
Coding Index
1.424.8
Math Index
38.772.7
GPQA Diamond
35.6%76.8%
MMLU-Pro
57.0%84.9%
LiveCodeBench
30.8%68.8%
AIME 2025
38.7%72.7%
MATH-500
89.4%
Humanity's Last Exam
4.8%11.0%
SciCode
4.3%41.0%
IFBench
26.9%46.3%
TerminalBench
0.0%16.7%
Qwen3 1.7B (Reasoning)2 wins
10 winsCogito v2.1 (Reasoning)

Frequently Asked Questions

Which is cheaper, Qwen3 1.7B (Reasoning) or Cogito v2.1 (Reasoning)?

Qwen3 1.7B (Reasoning) is cheaper overall. Its blended price (3:1 input/output ratio) is $0.40/M tokens vs $1.25/M for Cogito v2.1 (Reasoning).

Which model performs better on benchmarks?

Cogito v2.1 (Reasoning) wins 10 out of 12 benchmarks compared to 2 for Qwen3 1.7B (Reasoning). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Qwen3 1.7B (Reasoning) generates tokens faster at 140 tok/s vs 92 tok/s. However, Cogito v2.1 (Reasoning) has lower time-to-first-token (0.27s vs 0.88s).

When should I use Qwen3 1.7B (Reasoning) vs Cogito v2.1 (Reasoning)?

Choose based on your priorities: Qwen3 1.7B (Reasoning) for lower cost, Cogito v2.1 (Reasoning) for stronger benchmark performance, and Qwen3 1.7B (Reasoning) for faster generation. For latency-sensitive apps, check the TTFT comparison above.