IBM
Granite 4.0 H Small
AI model by IBM. Real-time pricing and benchmark data.
Benchmarks
Coding Index8.5
Math Index13.7
GPQA Diamond41.6%
MMLU-Pro62.4%
LiveCodeBench25.1%
AIME 202513.7%
SciCode20.9%
IFBench31.5%
TerminalBench2.3%
Compare with similar models
| Model | Input | Output | Speed |
|---|---|---|---|
Granite 4.0 H SmallCurrent | $0.06 | $0.25 | 524 tok/s |
| Llama 2 Chat 7B | $0.05 | $0.25 | 120 tok/s |
| Mistral 7B Instruct | $0.25 | $0.25 | 188 tok/s |
| NVIDIA Nemotron 3 Nano 30B A3B (Reasoning) | $0.06 | $0.24 | 138 tok/s |
| Nova Lite | $0.06 | $0.24 | 190 tok/s |
| DeepSeek R1 Distill Qwen 32B | $0.27 | $0.27 | 59 tok/s |
Compare Granite 4.0 H Small with
Example Costs
Single Request
$0.0002
1.0K in / 500 out
1K Requests/day
$0.1850
1.0M in / 500.0K out
10K Requests/day
$1.85
10.0M in / 5.0M out