Compare/Llama 3.2 Instruct 3B vs Solar Mini

Llama 3.2 Instruct 3BvsSolar Mini

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Meta

Llama 3.2 Instruct 3B

Input
$0.15/M
Output
$0.15/M
Speed
52 tok/s
TTFT
0.72s
Upstage

Solar Mini

Input
$0.15/M
Output
$0.15/M
Speed
64 tok/s
TTFT
1.03s

Winner by Category

Cheaper
Tie
Faster (tok/s)
Solar Mini
Lower Latency
Llama 3.2 Instruct 3B
Benchmarks (9-1)
Llama 3.2 Instruct 3B

Pricing Comparison

MetricLlama 3.2 Instruct 3BSolar Mini
Input ($/M tokens)$0.15$0.15
Output ($/M tokens)$0.15$0.15
Cost for 1M input + 100K output tokens:
Llama 3.2 Instruct 3B$0.16
Solar Mini$0.16

Speed Comparison

Output Speed (tokens/s) — higher is better
Llama 3.2 Instruct 3B
52 tok/s
Solar Mini
64 tok/s
Time to First Token (seconds) — lower is better
Llama 3.2 Instruct 3B
0.72s
Solar Mini
1.03s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
9.711.9
Coding Index
Math Index
3.3
GPQA Diamond
25.5%
MMLU-Pro
34.7%
LiveCodeBench
8.3%
AIME 2025
3.3%
MATH-500
48.9%33.1%
Humanity's Last Exam
5.2%
SciCode
5.2%
IFBench
26.2%
TerminalBench
Llama 3.2 Instruct 3B9 wins
1 winsSolar Mini

Frequently Asked Questions

Which is cheaper, Llama 3.2 Instruct 3B or Solar Mini?

Both models have similar pricing. Check the detailed breakdown above for input vs output token costs.

Which model performs better on benchmarks?

Llama 3.2 Instruct 3B wins 9 out of 12 benchmarks compared to 1 for Solar Mini. See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Solar Mini generates tokens faster at 64 tok/s vs 52 tok/s. Llama 3.2 Instruct 3B also has lower time-to-first-token (0.72s vs 1.03s).

When should I use Llama 3.2 Instruct 3B vs Solar Mini?

Choose based on your priorities: both are similarly priced, Llama 3.2 Instruct 3B for stronger benchmark performance, and Solar Mini for faster generation. For latency-sensitive apps, check the TTFT comparison above.