NVIDIA

Nemotron Cascade 2 30B A3B

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.00
Output$0.00
Blended (3:1)$0.00

Source: Artificial Analysis

Performance

Output Speed0 tok/s
Time to First Token0ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Nemotron Cascade 2 30B A3BCurrent
$0.00$0.000 tok/s
gpt-oss-20B (low)
$0.06$0.20321 tok/s
gpt-oss-120B (low)
$0.15$0.60267 tok/s
gpt-oss-120B (high)
$0.15$0.60277 tok/s
GPT-5.4 nano (medium)
$0.20$1.25186 tok/s
GPT-5.4 mini (xhigh)
$0.75$4.50220 tok/s

Example Costs

Single Request
<$0.0001
1.0K in / 500 out
1K Requests/day
<$0.0001
1.0M in / 500.0K out
10K Requests/day
<$0.0001
10.0M in / 5.0M out