All AI Models

446 models across 47 providers. Click any model for detailed pricing, benchmarks, and speed data.

OpenAI

55 models
GPT-5.4 (xhigh)
$5.6384 t/s
GPT-5.3 Codex (xhigh)
$4.8178 t/s
GPT-5.2 (xhigh)
$4.8163 t/s
GPT-5.2 Codex (xhigh)
$4.81123 t/s
GPT-5.4 mini (xhigh)
$1.69219 t/s
GPT-5.1 (high)
$3.4485 t/s
GPT-5.2 (medium)
$4.810 t/s
GPT-5 (high)
$3.4494 t/s
GPT-5 Codex (high)
$3.44216 t/s
GPT-5.4 nano (xhigh)
$0.46178 t/s
GPT-5.1 Codex (high)
$3.44139 t/s
GPT-5 (medium)
$3.4469 t/s
GPT-5 mini (high)
$0.6989 t/s
o3-pro
$35.0021 t/s
GPT-5 (low)
$3.4466 t/s
GPT-5 mini (medium)
$0.6972 t/s
GPT-5.1 Codex mini (high)
$0.69188 t/s
o3
$3.5094 t/s
GPT-5.4 nano (medium)
$0.46169 t/s
GPT-5.4 mini (medium)
$1.69200 t/s
GPT-5.4 (Non-reasoning)
$5.6361 t/s
GPT-5.2 (Non-reasoning)
$4.8163 t/s
gpt-oss-120B (high)
$0.26252 t/s
o4-mini (high)
$1.93143 t/s
o1
$26.25113 t/s
GPT-5.1 (Non-reasoning)
$3.4477 t/s
GPT-5 nano (high)
$0.14151 t/s
GPT-4.1
$3.50102 t/s
GPT-5 nano (medium)
$0.14164 t/s
o3-mini
$1.93153 t/s
o1-pro
$262.500 t/s
o3-mini (high)
$1.93147 t/s
gpt-oss-120B (low)
$0.26253 t/s
gpt-oss-20B (high)
$0.09316 t/s
GPT-5.4 nano (Non-Reasoning)
$0.46177 t/s
GPT-5 (minimal)
$3.4482 t/s
o1-preview
$28.880 t/s
GPT-5.4 mini (Non-Reasoning)
$1.69202 t/s
GPT-4.1 mini
$0.7079 t/s
GPT-5 (ChatGPT)
$3.44148 t/s
gpt-oss-20B (low)
$0.09320 t/s
GPT-5 mini (minimal)
$0.6997 t/s
o1-mini
$0.000 t/s
GPT-4.5 (Preview)
$0.000 t/s
GPT-4o (Aug '24)
$4.3884 t/s
GPT-4o (March 2025, chatgpt-4o-latest)
$0.000 t/s
GPT-4o (Nov '24)
$4.38114 t/s
GPT-4o (May '24)
$7.5075 t/s
GPT-4o (ChatGPT)
$0.000 t/s
GPT-5 nano (minimal)
$0.14158 t/s
GPT-4 Turbo
$15.0030 t/s
GPT-4.1 nano
$0.17159 t/s
GPT-4
$37.5039 t/s
GPT-4o mini
$0.2653 t/s
GPT-3.5 Turbo
$0.7597 t/s

Anthropic

28 models

Google

42 models
Gemini 3.1 Pro Preview
$4.50115 t/s
Gemini 3 Pro Preview (high)
$4.50115 t/s
Gemini 3 Flash Preview (Reasoning)
$1.13180 t/s
Gemini 3 Pro Preview (low)
$4.50111 t/s
Gemini 3 Flash Preview (Non-reasoning)
$1.13190 t/s
Gemini 2.5 Pro
$3.44131 t/s
Gemini 3.1 Flash-Lite Preview
$0.56216 t/s
Gemini 2.5 Flash Preview (Sep '25) (Reasoning)
$0.000 t/s
Gemini 2.5 Pro Preview (Mar' 25)
$0.000 t/s
Gemini 2.5 Pro Preview (May' 25)
$3.440 t/s
Gemini 2.5 Flash (Reasoning)
$0.85226 t/s
Gemini 2.5 Flash Preview (Sep '25) (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash Preview (Reasoning)
$0.000 t/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Reasoning)
$0.17365 t/s
Gemini 2.5 Flash (Non-reasoning)
$0.85212 t/s
Gemini 2.0 Flash Thinking Experimental (Jan '25)
$0.000 t/s
Gemini 2.5 Flash-Lite Preview (Sep '25) (Non-reasoning)
$0.17344 t/s
Gemini 2.0 Flash (Feb '25)
$0.260 t/s
Gemini 2.0 Pro Experimental (Feb '25)
$0.000 t/s
Gemini 2.5 Flash Preview (Non-reasoning)
$0.000 t/s
Gemini 2.5 Flash-Lite (Reasoning)
$0.17318 t/s
Gemini 2.0 Flash (experimental)
$0.000 t/s
Gemini 1.5 Pro (Sep '24)
$0.000 t/s
Gemini 2.0 Flash-Lite (Feb '25)
$0.000 t/s
Gemini 2.0 Flash-Lite (Preview)
$0.000 t/s
Gemini 1.5 Flash (Sep '24)
$0.000 t/s
Gemini 2.5 Flash-Lite (Non-reasoning)
$0.17328 t/s
Gemini 2.0 Flash Thinking Experimental (Dec '24)
$0.000 t/s
Gemini 1.5 Pro (May '24)
$0.000 t/s
Gemini 1.5 Flash-8B
$0.000 t/s
Gemini 1.5 Flash (May '24)
$0.000 t/s
Gemma 3 27B Instruct
$0.0032 t/s
Gemma 3n E4B Instruct Preview (May '25)
$0.000 t/s
Gemini 1.0 Ultra
$0.000 t/s
Gemma 3 12B Instruct
$0.0032 t/s
PALM-2
$0.000 t/s
Gemini 1.0 Pro
$0.000 t/s
Gemma 3 270M
$0.000 t/s
Gemma 3n E4B Instruct
$0.0327 t/s
Gemma 3 4B Instruct
$0.0031 t/s
Gemma 3 1B Instruct
$0.0046 t/s
Gemma 3n E2B Instruct
$0.0041 t/s

DeepSeek

25 models

Meta

16 models

Mistral

31 models

xAI

14 models

Alibaba / Qwen

71 models
Qwen3.5 397B A17B (Reasoning)
$1.3554 t/s
Qwen3.5 27B (Reasoning)
$0.8290 t/s
Qwen3.5 122B A10B (Reasoning)
$1.10134 t/s
Qwen3.5 397B A17B (Non-reasoning)
$1.3555 t/s
Qwen3 Max Thinking
$2.4034 t/s
Qwen3.5 27B (Non-reasoning)
$0.8290 t/s
Qwen3.5 35B A3B (Reasoning)
$0.69111 t/s
Qwen3.5 122B A10B (Non-reasoning)
$1.10148 t/s
Qwen3 Max Thinking (Preview)
$2.4045 t/s
Qwen3.5 9B (Reasoning)
$0.1155 t/s
Qwen3 Max
$2.4033 t/s
Qwen3.5 35B A3B (Non-reasoning)
$0.69119 t/s
Qwen3 235B A22B 2507 (Reasoning)
$2.6345 t/s
Qwen3 Coder Next
$0.60165 t/s
Qwen3 VL 235B A22B (Reasoning)
$2.6358 t/s
Qwen3.5 9B (Non-reasoning)
$0.000 t/s
Qwen3.5 4B (Reasoning)
$0.000 t/s
Qwen3 Next 80B A3B (Reasoning)
$1.88151 t/s
Qwen3 Max (Preview)
$2.4047 t/s
Qwen3 235B A22B 2507 Instruct
$1.2372 t/s
Qwen3 Coder 480B A35B Instruct
$3.0072 t/s
Qwen3 VL 32B (Reasoning)
$2.6397 t/s
Qwen3.5 4B (Non-reasoning)
$0.000 t/s
Qwen3 30B A3B 2507 (Reasoning)
$0.75142 t/s
Qwen3 VL 235B A22B Instruct
$1.2357 t/s
Qwen3 Next 80B A3B Instruct
$0.88154 t/s
Qwen3 Coder 30B A3B Instruct
$0.9026 t/s
Qwen3 235B A22B (Reasoning)
$2.6349 t/s
QwQ 32B
$0.7433 t/s
Qwen3 VL 30B A3B (Reasoning)
$0.75130 t/s
Qwen3 4B 2507 (Reasoning)
$0.000 t/s
Qwen3 VL 32B Instruct
$1.2378 t/s
Qwen3 235B A22B (Non-reasoning)
$1.2347 t/s
Qwen3 VL 8B (Reasoning)
$0.66135 t/s
Qwen3 32B (Reasoning)
$2.63103 t/s
Qwen3.5 2B (Reasoning)
$0.000 t/s
Qwen2.5 Max
$2.8048 t/s
Qwen3 14B (Reasoning)
$1.3165 t/s
Qwen3 VL 30B A3B Instruct
$0.35129 t/s
Qwen3 Omni 30B A3B (Reasoning)
$0.43102 t/s
Qwen2.5 Instruct 72B
$0.0056 t/s
Qwen3 30B A3B (Reasoning)
$0.7568 t/s
QwQ 32B-Preview
$0.1459 t/s
Qwen3 30B A3B 2507 Instruct
$0.3548 t/s
Qwen3.5 2B (Non-reasoning)
$0.000 t/s
Qwen3 32B (Non-reasoning)
$1.23104 t/s
Qwen3 VL 8B Instruct
$0.31141 t/s
Qwen3 4B (Reasoning)
$0.40103 t/s
Qwen3 VL 4B (Reasoning)
$0.000 t/s
Qwen3 8B (Reasoning)
$0.6690 t/s
Qwen2.5 Instruct 32B
$0.000 t/s
Qwen2.5 Coder Instruct 32B
$0.000 t/s
Qwen3 4B 2507 Instruct
$0.000 t/s
Qwen3 14B (Non-reasoning)
$0.6166 t/s
Qwen3 4B (Non-reasoning)
$0.19105 t/s
Qwen3 30B A3B (Non-reasoning)
$0.3568 t/s
Qwen2.5 Turbo
$0.0968 t/s
Qwen2 Instruct 72B
$0.000 t/s
Qwen3 Omni 30B A3B Instruct
$0.43108 t/s
Qwen3 8B (Non-reasoning)
$0.3181 t/s
Qwen3.5 0.8B (Reasoning)
$0.000 t/s
Qwen2.5 Coder Instruct 7B
$0.000 t/s
Qwen3.5 0.8B (Non-reasoning)
$0.000 t/s
Qwen3 VL 4B Instruct
$0.000 t/s
Qwen1.5 Chat 110B
$0.000 t/s
Qwen Chat 72B
$0.000 t/s
Qwen3 1.7B (Reasoning)
$0.40140 t/s
Qwen Chat 14B
$0.000 t/s
Qwen3 1.7B (Non-reasoning)
$0.19139 t/s
Qwen3 0.6B (Reasoning)
$0.40205 t/s
Qwen3 0.6B (Non-reasoning)
$0.19211 t/s

NVIDIA

15 models

Amazon

13 models

Microsoft

4 models

Cohere

4 models

Moonshot

6 models

ai2

10 models

ai21-labs

7 models

baidu

2 models

bytedance_seed

2 models

databricks

1 models

deepcogito

1 models

ibm

7 models

inception

1 models

inclusionai

5 models

korea-telecom

2 models

kwaikat

1 models

lg

6 models

liquidai

8 models

longcat

1 models

mbzuai

4 models

minimax

6 models

motif-technologies

1 models

nanbeige

1 models

nous-research

7 models

openchat

1 models

perplexity

5 models

prime-intellect

1 models

reka-ai

2 models

sarvam

3 models

servicenow

2 models

snowflake

1 models

stepfun

2 models

swiss-ai-initiative

2 models

tii-uae

1 models

trillionlabs

2 models

upstage

6 models

xiaomi

5 models

zai

15 models