Compare/GPT-4 Turbo vs Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

GPT-4 TurbovsClaude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Side-by-side comparison of pricing, 12 benchmarks, and generation speed.

OpenAI

GPT-4 Turbo

Input
$10/M
Output
$30/M
Speed
30 tok/s
TTFT
0.87s
Anthropic

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Input
$3/M
Output
$15/M
Speed
66 tok/s
TTFT
33.99s

Winner by Category

Cheaper
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
Faster (tok/s)
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
Lower Latency
GPT-4 Turbo
Benchmarks (3-7)
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Pricing Comparison

MetricGPT-4 TurboClaude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
Input ($/M tokens)$10$3
Output ($/M tokens)$30$15
Cost for 1M input + 100K output tokens:
GPT-4 Turbo$13.00
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)$4.50

Speed Comparison

Output Speed (tokens/s) — higher is better
GPT-4 Turbo
30 tok/s
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
66 tok/s
Time to First Token (seconds) — lower is better
GPT-4 Turbo
0.87s
Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)
33.99s

Benchmark Comparison

Data from Artificial Analysis API — 12 benchmarks

Intelligence Index
13.751.7
Coding Index
21.550.9
Math Index
GPQA Diamond
87.5%
MMLU-Pro
69.4%
LiveCodeBench
29.1%
AIME 2025
MATH-500
73.7%
Humanity's Last Exam
3.3%30.0%
SciCode
31.9%46.8%
IFBench
56.6%
TerminalBench
53.0%
GPT-4 Turbo3 wins
7 winsClaude Sonnet 4.6 (Adaptive Reasoning, Max Effort)

Frequently Asked Questions

Which is cheaper, GPT-4 Turbo or Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)?

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) is cheaper overall. Its blended price (3:1 input/output ratio) is $6.00/M tokens vs $15.00/M for GPT-4 Turbo.

Which model performs better on benchmarks?

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) wins 7 out of 12 benchmarks compared to 3 for GPT-4 Turbo. See the detailed benchmark chart above for per-category results.

Which is faster for real-time applications?

Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) generates tokens faster at 66 tok/s vs 30 tok/s. GPT-4 Turbo also has lower time-to-first-token (0.87s vs 33.99s).

When should I use GPT-4 Turbo vs Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort)?

Choose based on your priorities: Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) for lower cost, Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) for stronger benchmark performance, and Claude Sonnet 4.6 (Adaptive Reasoning, Max Effort) for faster generation. For latency-sensitive apps, check the TTFT comparison above.