NVIDIA

Nemotron Cascade 2 30B A3B

AI model by NVIDIA. Real-time pricing and benchmark data.

Pricing (per 1M tokens)

Input$0.00
Output$0.00
Blended (3:1)$0.00

Source: Artificial Analysis

Performance

Output Speed0 tok/s
Time to First Token0ms

Median values from Artificial Analysis

Compare with similar models

ModelInputOutputSpeed
Nemotron Cascade 2 30B A3BCurrent
$0.00$0.000 tok/s
GPT-5.4 nano (xhigh)
$0.20$1.25149 tok/s
gpt-oss-20B (low)
$0.06$0.20277 tok/s
gpt-oss-120B (low)
$0.15$0.60264 tok/s
o3
$2.00$8.0091 tok/s
GPT-5.5 (high)
$5.00$30.0065 tok/s

Example Costs

Single Request
<$0.0001
1.0K in / 500 out
1K Requests/day
<$0.0001
1.0M in / 500.0K out
10K Requests/day
<$0.0001
10.0M in / 5.0M out