Compare/Command-R (Mar '24) vs Magistral Small 1.2

Command-R (Mar '24)vsMagistral Small 1.2

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Cohere

Command-R (Mar '24)

Input
$0.5/M
Output
$1.5/M
Speed
TTFT
Mistral

Magistral Small 1.2

Input
$0.5/M
Output
$1.5/M
Speed
190 tok/s
TTFT
0.33s

Winner by Category

Cheaper
Tie
Faster (tok/s)
Magistral Small 1.2
Lower Latency
Command-R (Mar '24)
Benchmarks (1-11)
Magistral Small 1.2

Pricing Comparison

MetricCommand-R (Mar '24)Magistral Small 1.2
Input ($/M tokens)$0.5$0.5
Output ($/M tokens)$1.5$1.5
Cost for 1M input + 100K output tokens:
Command-R (Mar '24)$0.65
Magistral Small 1.2$0.65

Speed Comparison

Output Speed (tokens/s) — higher is better
Command-R (Mar '24)
Magistral Small 1.2
190 tok/s
Time to First Token (seconds) — lower is better
Command-R (Mar '24)
Magistral Small 1.2
0.33s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
7.418.2
Coding Index
14.8
Math Index
80.3
GPQA Diamond
28.4%66.3%
MMLU-Pro
33.8%76.8%
LiveCodeBench
4.8%72.3%
AIME 2025
80.3%
MATH-500
16.4%
Humanity's Last Exam
4.8%6.1%
SciCode
6.2%35.2%
IFBench
44.4%
TerminalBench
4.5%
Command-R (Mar '24)1 wins
11 winsMagistral Small 1.2

Frequently Asked Questions

Which is cheaper, Command-R (Mar '24) or Magistral Small 1.2?

Both models have similar pricing. Check the detailed breakdown above for input vs output token costs.

Which model performs better on benchmarks?

Magistral Small 1.2 wins 11 out of 12 benchmarks compared to 1 for Command-R (Mar '24). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Magistral Small 1.2 generates tokens faster at 190 tok/s vs 0 tok/s. Command-R (Mar '24) also has lower time-to-first-token (0.00s vs 0.33s).

When should I use Command-R (Mar '24) vs Magistral Small 1.2?

Choose based on your priorities: both are similarly priced, Magistral Small 1.2 for stronger benchmark performance, and Magistral Small 1.2 for faster generation. For latency-sensitive apps, check the TTFT comparison above.