IBM
Granite 3.3 8B (Non-reasoning)
AI model by IBM. Real-time pricing and benchmark data.
Benchmarks
Coding Index3.4
Math Index6.7
GPQA Diamond33.8%
MMLU-Pro46.8%
LiveCodeBench12.7%
AIME 20256.7%
MATH-50066.5%
SciCode10.1%
IFBench22.4%
TerminalBench0.0%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Granite 3.3 8B (Non-reasoning)Current | $0.03 | $0.25 | 386 tok/s |
| Llama 2 Chat 7B | $0.05 | $0.25 | 120 tok/s |
| Mistral 7B Instruct | $0.25 | $0.25 | 188 tok/s |
| NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | $0.06 | $0.24 | 138 tok/s |
| Nova Lite | $0.06 | $0.24 | 190 tok/s |
| DeepSeek R1 Distill Qwen 32B | $0.27 | $0.27 | 59 tok/s |
Compare Granite 3.3 8B (Non-reasoning) with
Example Costs
Single Request
$0.0002
1.0K in / 500 out
1K Requests/day
$0.1550
1.0M in / 500.0K out
10K Requests/day
$1.55
10.0M in / 5.0M out