IBM

Granite 3.3 8B (Non-reasoning)

AI model by IBM. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.03
Output$0.25
Blended (3:1)$0.09

Source: Artificial Analysis

Performance

Output Speed380 tok/s
Time to First Token20300ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Granite 3.3 8B (Non-reasoning)Current
$0.03$0.25380 tok/s
Llama 2 Chat 7B
$0.05$0.2598 tok/s
Gemma 3 27B Instruct
$0.11$0.250 tok/s
Mistral Small 3.2
$0.09$0.25121 tok/s
Llama 3.2 Instruct 11B (Vision)
$0.24$0.2485 tok/s
Nova Lite
$0.06$0.24219 tok/s

Example Costs

Single Request
$0.0002
1.0K in / 500 out
1K Requests/day
$0.1550
1.0M in / 500.0K out
10K Requests/day
$1.55
10.0M in / 5.0M out