Compare/Magistral Small 1.2 vs Command-R (Mar '24)

Magistral Small 1.2vsCommand-R (Mar '24)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Mistral

Magistral Small 1.2

Input
$0.5/M
Output
$1.5/M
Speed
190 tok/s
TTFT
0.33s
Cohere

Command-R (Mar '24)

Input
$0.5/M
Output
$1.5/M
Speed
TTFT

Winner by Category

Cheaper
Tie
Faster (tok/s)
Magistral Small 1.2
Lower Latency
Command-R (Mar '24)
Benchmarks (11-1)
Magistral Small 1.2

Pricing Comparison

MetricMagistral Small 1.2Command-R (Mar '24)
Input ($/M tokens)$0.5$0.5
Output ($/M tokens)$1.5$1.5
Cost for 1M input + 100K output tokens:
Magistral Small 1.2$0.65
Command-R (Mar '24)$0.65

Speed Comparison

Output Speed (tokens/s) — higher is better
Magistral Small 1.2
190 tok/s
Command-R (Mar '24)
Time to First Token (seconds) — lower is better
Magistral Small 1.2
0.33s
Command-R (Mar '24)

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
18.27.4
Coding Index
14.8
Math Index
80.3
GPQA Diamond
66.3%28.4%
MMLU-Pro
76.8%33.8%
LiveCodeBench
72.3%4.8%
AIME 2025
80.3%
MATH-500
16.4%
Humanity's Last Exam
6.1%4.8%
SciCode
35.2%6.2%
IFBench
44.4%
TerminalBench
4.5%
Magistral Small 1.211 wins
1 winsCommand-R (Mar '24)

Frequently Asked Questions

Which is cheaper, Magistral Small 1.2 or Command-R (Mar '24)?

Both models have similar pricing. Check the detailed breakdown above for input vs output token costs.

Which model performs better on benchmarks?

Magistral Small 1.2 wins 11 out of 12 benchmarks compared to 1 for Command-R (Mar '24). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Magistral Small 1.2 generates tokens faster at 190 tok/s vs 0 tok/s. However, Command-R (Mar '24) has lower time-to-first-token (0.00s vs 0.33s).

When should I use Magistral Small 1.2 vs Command-R (Mar '24)?

Choose based on your priorities: both are similarly priced, Magistral Small 1.2 for stronger benchmark performance, and Magistral Small 1.2 for faster generation. For latency-sensitive apps, check the TTFT comparison above.