IBM
Granite 3.3 8B (Non-reasoning)
AI model by IBM. Real-time pricing and benchmark data.
Benchmarks
Coding Index3.4
Math Index6.7
GPQA Diamond33.8%
MMLU-Pro46.8%
LiveCodeBench12.7%
AIME 20256.7%
MATH-50066.5%
SciCode10.1%
IFBench22.4%
TerminalBench0.0%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Granite 3.3 8B (Non-reasoning)Current | $0.03 | $0.25 | 380 tok/s |
| Llama 2 Chat 7B | $0.05 | $0.25 | 98 tok/s |
| Gemma 3 27B Instruct | $0.11 | $0.25 | 0 tok/s |
| Mistral Small 3.2 | $0.09 | $0.25 | 121 tok/s |
| Llama 3.2 Instruct 11B (Vision) | $0.24 | $0.24 | 85 tok/s |
| Nova Lite | $0.06 | $0.24 | 219 tok/s |
Compare Granite 3.3 8B (Non-reasoning) with
Example Costs
Single Request
$0.0002
1.0K in / 500 out
1K Requests/day
$0.1550
1.0M in / 500.0K out
10K Requests/day
$1.55
10.0M in / 5.0M out