Compare/Jamba 1.6 Large vs Qwen3 VL 235B A22B (Reasoning)

Jamba 1.6 LargevsQwen3 VL 235B A22B (Reasoning)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

AI21 Labs

Jamba 1.6 Large

Input
$2/M
Output
$8/M
Speed
59 tok/s
TTFT
0.73s
Alibaba

Qwen3 VL 235B A22B (Reasoning)

Input
$0.7/M
Output
$8.4/M
Speed
58 tok/s
TTFT
1.11s

Winner by Category

Cheaper
Qwen3 VL 235B A22B (Reasoning)
Faster (tok/s)
Jamba 1.6 Large
Lower Latency
Jamba 1.6 Large
Benchmarks (1-11)
Qwen3 VL 235B A22B (Reasoning)

Pricing Comparison

MetricJamba 1.6 LargeQwen3 VL 235B A22B (Reasoning)
Input ($/M tokens)$2$0.7
Output ($/M tokens)$8$8.4
Cost for 1M input + 100K output tokens:
Jamba 1.6 Large$2.80
Qwen3 VL 235B A22B (Reasoning)$1.54

Speed Comparison

Output Speed (tokens/s) — higher is better
Jamba 1.6 Large
59 tok/s
Qwen3 VL 235B A22B (Reasoning)
58 tok/s
Time to First Token (seconds) — lower is better
Jamba 1.6 Large
0.73s
Qwen3 VL 235B A22B (Reasoning)
1.11s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
10.627.6
Coding Index
20.9
Math Index
88.3
GPQA Diamond
38.7%77.2%
MMLU-Pro
56.5%83.6%
LiveCodeBench
17.2%64.6%
AIME 2025
88.3%
MATH-500
58.0%
Humanity's Last Exam
4.0%10.1%
SciCode
18.4%39.9%
IFBench
56.5%
TerminalBench
11.4%
Jamba 1.6 Large1 wins
11 winsQwen3 VL 235B A22B (Reasoning)

Frequently Asked Questions

Which is cheaper, Jamba 1.6 Large or Qwen3 VL 235B A22B (Reasoning)?

Qwen3 VL 235B A22B (Reasoning) is cheaper overall. Its blended price (3:1 input/output ratio) is $2.63/M tokens vs $3.50/M for Jamba 1.6 Large.

Which model performs better on benchmarks?

Qwen3 VL 235B A22B (Reasoning) wins 11 out of 12 benchmarks compared to 1 for Jamba 1.6 Large. See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Jamba 1.6 Large generates tokens faster at 59 tok/s vs 58 tok/s. Jamba 1.6 Large also has lower time-to-first-token (0.73s vs 1.11s).

When should I use Jamba 1.6 Large vs Qwen3 VL 235B A22B (Reasoning)?

Choose based on your priorities: Qwen3 VL 235B A22B (Reasoning) for lower cost, Qwen3 VL 235B A22B (Reasoning) for stronger benchmark performance, and Jamba 1.6 Large for faster generation. For latency-sensitive apps, check the TTFT comparison above.