Compare/Qwen3 8B (Reasoning) vs ERNIE 4.5 300B A47B

Qwen3 8B (Reasoning)vsERNIE 4.5 300B A47B

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

Alibaba

Qwen3 8B (Reasoning)

Input
$0.11/M
Output
$1.15/M
Speed
86 tok/s
TTFT
1.03s
Baidu

ERNIE 4.5 300B A47B

Input
$0.28/M
Output
$1.1/M
Speed
24 tok/s
TTFT
1.63s

Winner by Category

Cheaper
Qwen3 8B (Reasoning)
Faster (tok/s)
Qwen3 8B (Reasoning)
Lower Latency
Qwen3 8B (Reasoning)
Benchmarks (1-11)
ERNIE 4.5 300B A47B

Pricing Comparison

MetricQwen3 8B (Reasoning)ERNIE 4.5 300B A47B
Input ($/M tokens)$0.11$0.28
Output ($/M tokens)$1.15$1.1
Cost for 1M input + 100K output tokens:
Qwen3 8B (Reasoning)$0.22
ERNIE 4.5 300B A47B$0.39

Speed Comparison

Output Speed (tokens/s) — higher is better
Qwen3 8B (Reasoning)
86 tok/s
ERNIE 4.5 300B A47B
24 tok/s
Time to First Token (seconds) — lower is better
Qwen3 8B (Reasoning)
1.03s
ERNIE 4.5 300B A47B
1.63s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
13.215.0
Coding Index
9.014.5
Math Index
19.041.3
GPQA Diamond
58.9%81.1%
MMLU-Pro
74.3%77.6%
LiveCodeBench
40.6%46.7%
AIME 2025
19.0%41.3%
MATH-500
90.4%93.1%
Humanity's Last Exam
4.2%3.5%
SciCode
22.6%31.5%
IFBench
33.5%39.1%
TerminalBench
2.3%6.1%
Qwen3 8B (Reasoning)1 wins
11 winsERNIE 4.5 300B A47B

Frequently Asked Questions

Which is cheaper, Qwen3 8B (Reasoning) or ERNIE 4.5 300B A47B?

Qwen3 8B (Reasoning) is cheaper overall. Its blended price (3:1 input/output ratio) is $0.37/M tokens vs $0.48/M for ERNIE 4.5 300B A47B.

Which model performs better on benchmarks?

ERNIE 4.5 300B A47B wins 11 out of 12 benchmarks compared to 1 for Qwen3 8B (Reasoning). See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Qwen3 8B (Reasoning) generates tokens faster at 86 tok/s vs 24 tok/s. Qwen3 8B (Reasoning) also has lower time-to-first-token (1.03s vs 1.63s).

When should I use Qwen3 8B (Reasoning) vs ERNIE 4.5 300B A47B?

Choose based on your priorities: Qwen3 8B (Reasoning) for lower cost, ERNIE 4.5 300B A47B for stronger benchmark performance, and Qwen3 8B (Reasoning) for faster generation. For latency-sensitive apps, check the TTFT comparison above.