Compare/Llama 3.2 Instruct 11B (Vision) vs Mistral Small 3.2

Llama 3.2 Instruct 11B (Vision)vsMistral Small 3.2

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Meta

Llama 3.2 Instruct 11B (Vision)

Input
$0.245/M
Output
$0.245/M
Speed
85 tok/s
TTFT
0.46s
Mistral

Mistral Small 3.2

Input
$0.087/M
Output
$0.25/M
Speed
121 tok/s
TTFT
0.43s

Winner by Category

Cheaper
Mistral Small 3.2
Faster (tok/s)
Mistral Small 3.2
Lower Latency
Mistral Small 3.2
Benchmarks (1-11)
Mistral Small 3.2

Pricing Comparison

MetricLlama 3.2 Instruct 11B (Vision)Mistral Small 3.2
Input ($/M tokens)$0.245$0.087
Output ($/M tokens)$0.245$0.25
Cost for 1M input + 100K output tokens:
Llama 3.2 Instruct 11B (Vision)$0.27
Mistral Small 3.2$0.11

Speed Comparison

Output Speed (tokens/s) — higher is better
Llama 3.2 Instruct 11B (Vision)
85 tok/s
Mistral Small 3.2
121 tok/s
Time to First Token (seconds) — lower is better
Llama 3.2 Instruct 11B (Vision)
0.46s
Mistral Small 3.2
0.43s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
8.715.1
Coding Index
4.213.3
Math Index
1.727.0
GPQA Diamond
22.1%50.5%
MMLU-Pro
46.4%68.1%
LiveCodeBench
11.0%27.5%
AIME 2025
1.7%27.0%
MATH-500
51.6%88.3%
Humanity's Last Exam
5.2%4.3%
SciCode
11.2%26.4%
IFBench
30.4%33.5%
TerminalBench
0.8%6.8%
Llama 3.2 Instruct 11B (Vision)1 wins
11 winsMistral Small 3.2

Frequently Asked Questions

Which is cheaper, Llama 3.2 Instruct 11B (Vision) or Mistral Small 3.2?

Mistral Small 3.2 is cheaper overall. Its blended price (3:1 input/output ratio) is $0.13/M tokens vs $0.24/M for Llama 3.2 Instruct 11B (Vision).

Which model performs better on benchmarks?

Mistral Small 3.2 wins 11 out of 12 benchmarks compared to 1 for Llama 3.2 Instruct 11B (Vision). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Mistral Small 3.2 generates tokens faster at 121 tok/s vs 85 tok/s. However, Mistral Small 3.2 has lower time-to-first-token (0.43s vs 0.46s).

When should I use Llama 3.2 Instruct 11B (Vision) vs Mistral Small 3.2?

Choose based on your priorities: Mistral Small 3.2 for lower cost, Mistral Small 3.2 for stronger benchmark performance, and Mistral Small 3.2 for faster generation. For latency-sensitive apps, check the TTFT comparison above.