Alibaba
Qwen3 8B (Non-reasoning)
AI model by Alibaba. Real-time pricing and benchmark data.
Benchmarks
Coding Index7.1
Math Index24.3
GPQA Diamond45.2%
MMLU-Pro64.3%
LiveCodeBench20.2%
AIME 202524.3%
MATH-50082.8%
SciCode16.8%
IFBench28.6%
TerminalBench2.3%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Qwen3 8B (Non-reasoning)Current | $0.18 | $0.70 | 81 tok/s |
| Llama 3.3 Instruct 70B | $0.58 | $0.71 | 99 tok/s |
| Llama 3.2 Instruct 90B (Vision) | $0.72 | $0.72 | 42 tok/s |
| Llama 4 Scout | $0.17 | $0.66 | 129 tok/s |
| NVIDIA Nemotron 3 Super 120B A12B (Reasoning) | $0.30 | $0.75 | 370 tok/s |
| Mercury 2 | $0.25 | $0.75 | 907 tok/s |
Compare Qwen3 8B (Non-reasoning) with
Example Costs
Single Request
$0.0005
1.0K in / 500 out
1K Requests/day
$0.5300
1.0M in / 500.0K out
10K Requests/day
$5.30
10.0M in / 5.0M out