Live speed data from Artificial Analysis API

AI Model Speed Rankings

Compare 510+ AI models by response speed, latency, and throughput. Find the fastest models for your use case.

510 models · click headers to sort
#
Model
Throughput
TTFT
$/1M
$/Speed
Price×TTFT
1
Mercury 2
Inception
736 t/s
3.75s
$0.38
$0.001
1408.1
2
380 t/s
20.30s
$0.09
$0.000
1725.5
3
341 t/s
640ms
$0.06
$0.000
39.0
4
333 t/s
229ms
$0.04
$0.000
9.2
5
308 t/s
580ms
$0.13
$0.000
76.0
6
284 t/s
260ms
$1.2
$0.004
312.0
7
283 t/s
345ms
$0.10
$0.000
34.5
8
277 t/s
5.00s
$0.56
$0.002
2813.9
9
267 t/s
8.77s
$0.11
$0.000
938.0
10
266 t/s
420ms
$0.10
$0.000
39.9
11
256 t/s
342ms
$0.09
$0.000
30.1
12
249 t/s
1.05s
$0.28
$0.001
290.1
13
247 t/s
1.21s
$0.00
14
234 t/s
714ms
$0.30
$0.001
214.2
15
232 t/s
549ms
$0.26
$0.001
143.8
16
230 t/s
19.07s
$0.17
$0.001
3336.5
17
LFM2 24B A2B
Liquid AI
227 t/s
292ms
$0.05
$0.000
15.2
18
227 t/s
865ms
$0.85
$0.004
735.3
19
226 t/s
911ms
$0.40
$0.002
362.6
20
226 t/s
1.36s
$0.17
$0.001
238.7
21
224 t/s
877ms
$0.19
$0.001
164.9
22
223 t/s
478ms
$0.10
$0.000
47.8
23
217 t/s
397ms
$0.15
$0.001
59.5
24
Nova Lite
Amazon
214 t/s
662ms
$0.10
$0.000
69.5
25
212 t/s
10.54s
$0.85
$0.004
8955.6
26
210 t/s
282ms
$0.02
$0.000
5.6
27
208 t/s
545ms
$0.26
$0.001
142.8
28
202 t/s
227ms
$0.06
$0.000
13.6
29
199 t/s
524ms
$0.85
$0.004
445.4
30
197 t/s
5.65s
$1.1
$0.006
6358.5
31
194 t/s
780ms
$1.1
$0.006
877.5
32
191 t/s
231ms
$0.06
$0.000
13.9
33
183 t/s
1.41s
$0.56
$0.003
787.6
34
183 t/s
3.80s
$3.4
$0.019
13067.8
35
182 t/s
798ms
$0.25
$0.001
199.5
36
182 t/s
1.39s
$0.84
$0.005
1171.5
37
178 t/s
3.93s
$1.7
$0.009
6640.6
38
176 t/s
5.30s
$0.69
$0.004
3645.7
39
173 t/s
467ms
$0.35
$0.002
163.5
40
172 t/s
3.48s
$1.7
$0.010
5865.8
41
171 t/s
1.07s
$1.9
$0.011
1998.8
42
171 t/s
1.06s
$0.88
$0.005
931.9
43
171 t/s
6.50s
$3.4
$0.020
22329.8
44
166 t/s
655ms
$3.4
$0.021
2251.9
45
165 t/s
573ms
$0.26
$0.002
150.1
46
164 t/s
503ms
$0.46
$0.003
232.9
47
164 t/s
793ms
$0.00
48
161 t/s
968ms
$0.41
$0.003
398.8
49
161 t/s
506ms
$0.26
$0.002
132.6
50
159 t/s
2.26s
$0.00
51
158 t/s
1.04s
$1.1
$0.007
1147.3
52
158 t/s
10.59s
$0.85
$0.005
9002.4
53
158 t/s
551ms
$0.30
$0.002
165.3
54
157 t/s
511ms
$1.5
$0.010
766.5
55
157 t/s
34.56s
$0.14
$0.001
4769.1
56
156 t/s
3.90s
$0.46
$0.003
1805.7
57
155 t/s
597ms
$1.7
$0.011
1007.7
58
155 t/s
863ms
$0.56
$0.004
485.9
59
154 t/s
881ms
$0.15
$0.001
132.1
60
154 t/s
493ms
$0.10
$0.001
51.3
61
153 t/s
506ms
$0.14
$0.001
69.8
62
153 t/s
818ms
$3.0
$0.020
2454.0
63
153 t/s
1.11s
$1.1
$0.007
1221.0
64
152 t/s
19.29s
$1.9
$0.013
37129.4
65
152 t/s
2.31s
$0.00
66
152 t/s
5.62s
$0.85
$0.006
4780.4
67
149 t/s
1.01s
$0.67
$0.005
683.1
68
149 t/s
20.02s
$0.85
$0.006
17017.8
69
149 t/s
794ms
$0.14
$0.001
109.6
70
148 t/s
5.24s
$3.4
$0.023
18025.4
71
148 t/s
743ms
$3.4
$0.023
2554.4
72
148 t/s
1.26s
$0.00
73
147 t/s
3.04s
$0.46
$0.003
1405.2
74
147 t/s
72.61s
$0.14
$0.001
10019.9
75
o3-mini
OpenAI
143 t/s
9.20s
$1.9
$0.013
17715.8
76
142 t/s
1.08s
$0.31
$0.002
333.6
77
142 t/s
1.60s
$0.15
$0.001
240.1
78
141 t/s
1.43s
$0.15
$0.001
214.4
79
141 t/s
1.05s
$0.19
$0.001
196.8
80
139 t/s
8.33s
$2.2
$0.016
18217.3
81
139 t/s
977ms
$0.40
$0.003
388.8
82
139 t/s
700ms
$0.09
$0.001
60.2
83
139 t/s
1.19s
$0.00
84
138 t/s
24.49s
$1.9
$0.014
47145.2
85
137 t/s
1.56s
$0.15
$0.001
234.6
86
134 t/s
1.14s
$0.66
$0.005
749.1
87
134 t/s
635ms
$0.40
$0.003
250.8
88
133 t/s
546ms
$0.29
$0.002
159.4
89
133 t/s
23.51s
$4.5
$0.034
105777.0
90
132 t/s
16.14s
$3.4
$0.026
55506.5
91
131 t/s
13.69s
$3.4
$0.026
47066.2
92
128 t/s
416ms
$0.06
$0.000
26.2
93
126 t/s
628ms
$4.4
$0.035
2747.5
94
126 t/s
1.09s
$0.34
$0.003
368.1
95
125 t/s
727ms
$3.4
$0.027
2499.4
96
125 t/s
69.37s
$4.5
$0.036
312174.0
97
124 t/s
316ms
$0.00
98
124 t/s
1.02s
$0.30
$0.002
305.7
99
123 t/s
947ms
$0.15
$0.001
144.9
100
122 t/s
1.14s
$0.21
$0.002
243.2
Showing top 100 of 510 models. Use search/filter to narrow down.

Speed Metrics Guide

Throughput (tokens/s)

Output generation speed in tokens per second. Higher is better.

Good: >50 t/s · Excellent: >100 t/s
Time to First Token (TTFT)

Delay before the first token appears. Lower is better.

Good: <500ms · Excellent: <200ms
Price/Performance

Cost efficiency ratios. Lower values indicate better value.

$/Speed: price per t/s · Price×TTFT: latency penalty

Compare pricing for all models side by side

Open AI API Cost Calculator →