All AI Models

510 models across 52 providers. Click any model for detailed pricing, benchmarks, and speed data.

OpenAI

61 models
GPT-5.5 (xhigh)
$11.2563 t/s
GPT-5.5 (high)
$11.2561 t/s
GPT-5.4 (xhigh)
$5.6380 t/s
GPT-5.5 (medium)
$11.2561 t/s
GPT-5.3 Codex (xhigh)
$4.8180 t/s
GPT-5.2 (xhigh)
$4.8169 t/s
GPT-5.5 (low)
$11.2558 t/s
GPT-5.2 Codex (xhigh)
$4.8194 t/s
GPT-5.4 mini (xhigh)
$1.69178 t/s
GPT-5.4 (low)
$5.6363 t/s
GPT-5.1 (high)
$3.44118 t/s
GPT-5.2 (medium)
$4.810 t/s
GPT-5 Codex (high)
$3.44171 t/s
GPT-5 (high)
$3.4477 t/s
GPT-5.4 nano (xhigh)
$0.46147 t/s
GPT-5.1 Codex (high)
$3.44183 t/s
GPT-5 (medium)
$3.4476 t/s
GPT-5 mini (high)
$0.6986 t/s
GPT-5.5 (Non-reasoning)
$11.2557 t/s
o3-pro
$35.0026 t/s
GPT-5 (low)
$3.4472 t/s
GPT-5 mini (medium)
$0.6977 t/s
GPT-5.1 Codex mini (high)
$0.69176 t/s
o3
$3.5095 t/s
GPT-5.4 nano (medium)
$0.46156 t/s
GPT-5.4 mini (medium)
$1.69172 t/s
GPT-5.4 (Non-reasoning)
$5.6362 t/s
GPT-5.2 (Non-reasoning)
$4.8168 t/s
gpt-oss-120B (high)
$0.26208 t/s
o4-mini (high)
$1.93152 t/s
o1
$26.2594 t/s
GPT-5.1 (Non-reasoning)
$3.44125 t/s
GPT-5 nano (high)
$0.14147 t/s
GPT-4.1
$3.50105 t/s
GPT-5 nano (medium)
$0.14157 t/s
o3-mini
$1.93143 t/s
o1-pro
$262.500 t/s
o3-mini (high)
$1.93138 t/s
gpt-oss-20B (high)
$0.09256 t/s
gpt-oss-120B (low)
$0.26232 t/s
GPT-5.4 nano (Non-Reasoning)
$0.46164 t/s
GPT-5 (minimal)
$3.4476 t/s
o1-preview
$28.880 t/s
GPT-5.4 mini (Non-Reasoning)
$1.69155 t/s
GPT-4.1 mini
$0.7080 t/s
GPT-5 (ChatGPT)
$3.44166 t/s
gpt-oss-20B (low)
$0.10266 t/s
GPT-5 mini (minimal)
$0.6986 t/s
o1-mini
$0.000 t/s
GPT-4.5 (Preview)
$0.000 t/s
GPT-4o (Aug '24)
$4.38101 t/s
GPT-4o (March 2025, chatgpt-4o-latest)
$0.000 t/s
GPT-4o (Nov '24)
$4.38126 t/s
GPT-4o (May '24)
$7.50108 t/s
GPT-4o (ChatGPT)
$0.000 t/s
GPT-5 nano (minimal)
$0.14149 t/s
GPT-4 Turbo
$15.0033 t/s
GPT-4.1 nano
$0.17112 t/s
GPT-4
$37.5030 t/s
GPT-4o mini
$0.2673 t/s
GPT-3.5 Turbo
$0.7593 t/s

Anthropic

30 models

Google

50 models
Gemini 3.1 Pro Preview
$4.50133 t/s
Gemini 3 Pro Preview (high)
$4.50125 t/s
Gemini 3 Flash Preview (Reasoning)
$1.13197 t/s
Gemini 3 Pro Preview (low)
$4.500 t/s
Gemma 4 31B (Reasoning)
$0.0035 t/s
Gemini 3 Flash Preview (Non-reasoning)
$1.13194 t/s
Gemini 2.5 Pro
$3.44132 t/s
Gemini 3.1 Flash-Lite Preview
$0.56277 t/s
Gemma 4 31B (Non-reasoning)
$0.000 t/s
Gemma 4 26B A4B (Reasoning)
$0.200 t/s
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)
$0.000 t/s
Gemini 2.5 Pro Preview (Mar' 25)
$0.000 t/s
Gemini 2.5 Pro Preview (May' 25)
$3.440 t/s
Gemma 4 26B A4B (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash (Reasoning)
$0.85212 t/s
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash Preview (Reasoning)
$0.000 t/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
$0.170 t/s
Gemini 2.5 Flash (Non-reasoning)
$0.85199 t/s
Gemini 2.0 Flash Thinking Experimental (Jan '25)
$0.000 t/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
$0.170 t/s
Gemma 4 E4B (Reasoning)
$0.540 t/s
Gemini 2.0 Flash (Feb '25)
$0.260 t/s
Gemini 2.0 Pro Experimental (Feb '25)
$0.000 t/s
Gemini 2.5 Flash Preview (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash-Lite (Reasoning)
$0.17230 t/s
Gemini 2.0 Flash (experimental)
$0.000 t/s
Gemini 1.5 Pro (Sep '24)
$0.000 t/s
Gemma 4 E2B (Reasoning)
$0.000 t/s
Gemma 4 E4B (Non-reasoning)
$0.540 t/s
Gemini 2.0 Flash-Lite (Feb '25)
$0.000 t/s
Gemini 2.0 Flash-Lite (Preview)
$0.000 t/s
Gemini 1.5 Flash (Sep '24)
$0.000 t/s
Gemini 2.5 Flash-Lite (Non-reasoning)
$0.17226 t/s
Gemini 2.0 Flash Thinking Experimental (Dec '24)
$0.000 t/s
Gemma 4 E2B (Non-reasoning)
$0.000 t/s
Gemini 1.5 Pro (May '24)
$0.000 t/s
Gemini 1.5 Flash-8B
$0.000 t/s
Gemini 1.5 Flash (May '24)
$0.000 t/s
Gemma 3 27B Instruct
$0.140 t/s
Gemini 1.0 Ultra
$0.000 t/s
Gemma 3n E4B Instruct Preview (May '25)
$0.000 t/s
Gemma 3 12B Instruct
$0.140 t/s
PALM-2
$0.000 t/s
Gemini 1.0 Pro
$0.000 t/s
Gemma 3 270M
$0.000 t/s
Gemma 3n E4B Instruct
$0.0342 t/s
Gemma 3 4B Instruct
$0.050 t/s
Gemma 3 1B Instruct
$0.000 t/s
Gemma 3n E2B Instruct
$0.000 t/s

DeepSeek

31 models

Meta

17 models

Mistral

32 models

xAI

18 models

Alibaba / Qwen

79 models
Qwen3.6 Max Preview
$2.9236 t/s
Qwen3.6 Plus
$1.1352 t/s
Qwen3.6 27B (Reasoning)
$1.3564 t/s
Qwen3.5 397B A17B (Reasoning)
$1.3552 t/s
Qwen3.6 35B A3B (Reasoning)
$0.56183 t/s
Qwen3.5 27B (Reasoning)
$0.8290 t/s
Qwen3.5 122B A10B (Reasoning)
$1.10158 t/s
Qwen3.5 397B A17B (Non-reasoning)
$1.3552 t/s
Qwen3 Max Thinking
$2.4049 t/s
Qwen3.5 Omni Plus
$1.5056 t/s
Qwen3.5 27B (Non-reasoning)
$0.8389 t/s
Qwen3.6 27B (Non-reasoning)
$1.3564 t/s
Qwen3.5 35B A3B (Reasoning)
$0.69119 t/s
Qwen3.5 122B A10B (Non-reasoning)
$1.10153 t/s
Qwen3 Max Thinking (Preview)
$2.4045 t/s
Qwen3.5 9B (Reasoning)
$0.1160 t/s
Qwen3.6 35B A3B (Non-reasoning)
$0.84182 t/s
Qwen3 Max
$3.0533 t/s
Qwen3.5 35B A3B (Non-reasoning)
$0.69121 t/s
Qwen3 235B A22B 2507 (Reasoning)
$0.8458 t/s
Qwen3 Coder Next
$0.56155 t/s
Qwen3 VL 235B A22B (Reasoning)
$2.1734 t/s
Qwen3.5 9B (Non-reasoning)
$0.000 t/s
Qwen3.5 4B (Reasoning)
$0.06202 t/s
Qwen3 Next 80B A3B (Reasoning)
$1.88171 t/s
Qwen3 Max (Preview)
$2.4047 t/s
Qwen3.5 Omni Flash
$0.28249 t/s
Qwen3 235B A22B 2507 Instruct
$0.3668 t/s
Qwen3 Coder 480B A35B Instruct
$0.6867 t/s
Qwen3 VL 32B (Reasoning)
$2.6397 t/s
Qwen3.5 4B (Non-reasoning)
$0.06191 t/s
Qwen3 30B A3B 2507 (Reasoning)
$0.67149 t/s
Qwen3 VL 235B A22B Instruct
$0.7046 t/s
Qwen3 Next 80B A3B Instruct
$0.88171 t/s
Qwen3 Coder 30B A3B Instruct
$0.35106 t/s
Qwen3 235B A22B (Reasoning)
$2.6365 t/s
Qwen3 VL 30B A3B (Reasoning)
$0.34126 t/s
QwQ 32B
$0.7431 t/s
Qwen3 4B 2507 (Reasoning)
$0.000 t/s
Qwen3 VL 32B Instruct
$1.2363 t/s
Qwen3 235B A22B (Non-reasoning)
$0.7969 t/s
Qwen3 VL 8B (Reasoning)
$0.66134 t/s
Qwen3 32B (Reasoning)
$0.28104 t/s
Qwen3.5 2B (Reasoning)
$0.040 t/s
Qwen2.5 Max
$2.8049 t/s
Qwen3 14B (Reasoning)
$0.7364 t/s
Qwen3 VL 30B A3B Instruct
$0.30124 t/s
Qwen3 Omni 30B A3B (Reasoning)
$0.4389 t/s
Qwen2.5 Instruct 72B
$0.3756 t/s
Qwen3 30B A3B (Reasoning)
$0.1867 t/s
QwQ 32B-Preview
$0.000 t/s
Qwen3 30B A3B 2507 Instruct
$0.21122 t/s
Qwen3.5 2B (Non-reasoning)
$0.04333 t/s
Qwen3 32B (Non-reasoning)
$0.26103 t/s
Qwen3 VL 8B Instruct
$0.31142 t/s
Qwen3 4B (Reasoning)
$0.40104 t/s
Qwen3 VL 4B (Reasoning)
$0.000 t/s
Qwen3 8B (Reasoning)
$0.3792 t/s
Qwen2.5 Instruct 32B
$0.000 t/s
Qwen2.5 Coder Instruct 32B
$0.000 t/s
Qwen3 4B 2507 Instruct
$0.000 t/s
Qwen3 14B (Non-reasoning)
$0.3865 t/s
Qwen3 4B (Non-reasoning)
$0.19104 t/s
Qwen3 30B A3B (Non-reasoning)
$0.1366 t/s
Qwen2.5 Turbo
$0.0972 t/s
Qwen2 Instruct 72B
$0.000 t/s
Qwen3 Omni 30B A3B Instruct
$0.43110 t/s
Qwen3 8B (Non-reasoning)
$0.1892 t/s
Qwen3.5 0.8B (Reasoning)
$0.020 t/s
Qwen2.5 Coder Instruct 7B
$0.000 t/s
Qwen3.5 0.8B (Non-reasoning)
$0.02210 t/s
Qwen3 VL 4B Instruct
$0.000 t/s
Qwen1.5 Chat 110B
$0.000 t/s
Qwen Chat 72B
$0.000 t/s
Qwen3 1.7B (Reasoning)
$0.40139 t/s
Qwen Chat 14B
$0.000 t/s
Qwen3 1.7B (Non-reasoning)
$0.19141 t/s
Qwen3 0.6B (Reasoning)
$0.40226 t/s
Qwen3 0.6B (Non-reasoning)
$0.19224 t/s

NVIDIA

17 models

Amazon

14 models

Microsoft

4 models

Cohere

4 models

Moonshot

8 models

ai2

10 models

ai21-labs

7 models

arcee

1 models

baidu

2 models

bytedance_seed

2 models

china-mobile

2 models

databricks

1 models

deepcogito

1 models

ibm

10 models

inception

1 models

inclusionai

7 models

korea-telecom

2 models

kwaikat

2 models

lg

7 models

liquidai

8 models

longcat

1 models

mbzuai

4 models

minimax

6 models

motif-technologies

1 models

nanbeige

1 models

nous-research

7 models

openbmb

1 models

openchat

1 models

perplexity

5 models

prime-intellect

1 models

reka-ai

2 models

sarvam

3 models

servicenow

2 models

snowflake

1 models

stepfun

3 models

swiss-ai-initiative

2 models

tencent

2 models

tii-uae

1 models

trillionlabs

2 models

upstage

7 models

xiaomi

9 models

zai

18 models

zyphra

1 models