Live data from Artificial Analysis API

AI Model Benchmarks

Compare 446+ AI models across 12 benchmarks — Intelligence, Coding, Math, Science, and more. Data updated hourly.

Benchmarks:
446 models · click headers to sort
#
Model
Speed
$/1M
AA Index
GPQA
MMLU-Pro
LiveCode
AIME
HLE
1
84 t/s
$5.6
57.2
92.0%
41.6%
2
115 t/s
$4.5
57.2
94.1%
44.7%
3
78 t/s
$4.8
54.0
91.5%
39.9%
4
54 t/s
$10.0
53.0
89.6%
36.7%
5
66 t/s
$6.0
51.7
87.5%
30.0%
6
63 t/s
$4.8
51.3
90.3%
87.4%
88.9%
99.0%
35.4%
7
60 t/s
$1.6
49.8
82.0%
27.2%
8
64 t/s
$10.0
49.7
86.6%
89.5%
87.1%
91.3%
28.4%
9
46 t/s
$0.53
49.6
87.4%
28.1%
10
83 t/s
$1.5
49.2
87.0%
28.3%
11
123 t/s
$4.8
49.0
89.9%
33.5%
12
238 t/s
$3.0
48.5
88.5%
30.0%
13
115 t/s
$4.5
48.4
90.8%
89.8%
91.7%
95.7%
37.2%
14
219 t/s
$1.7
48.1
87.5%
26.6%
15
85 t/s
$3.4
47.7
87.3%
87.0%
86.8%
94.0%
26.5%
16
36 t/s
$1.2
46.8
87.9%
29.4%
17
$0.00
46.8
84.7%
25.4%
18
$4.8
46.6
86.4%
85.9%
89.4%
96.7%
24.9%
19
55 t/s
$10.0
46.5
84.0%
18.6%
20
180 t/s
$1.1
46.4
89.8%
89.0%
90.8%
97.0%
34.7%
21
54 t/s
$1.4
45.0
89.3%
27.3%
22
94 t/s
$3.4
44.6
85.4%
87.1%
84.6%
94.3%
26.5%
23
216 t/s
$3.4
44.6
83.7%
86.5%
84.0%
98.7%
25.6%
24
178 t/s
$0.46
44.4
81.7%
26.5%
25
57 t/s
$6.0
44.4
79.9%
13.2%
26
$0.00
43.4
82.8%
19.9%
27
139 t/s
$3.4
43.1
86.0%
86.0%
84.9%
95.7%
23.4%
28
62 t/s
$10.0
43.1
81.0%
88.9%
73.8%
62.7%
12.9%
29
58 t/s
$6.0
43.0
83.4%
87.5%
71.4%
88.0%
17.3%
30
63 t/s
$6.0
42.6
79.7%
10.8%
31
90 t/s
$0.82
42.1
85.8%
22.2%
32
79 t/s
$1.0
42.1
85.9%
85.6%
89.4%
95.0%
25.1%
33
69 t/s
$3.4
42.0
84.2%
86.7%
70.3%
91.7%
23.5%
34
41 t/s
$30.0
42.0
80.9%
88.0%
65.4%
80.3%
11.9%
35
47 t/s
$0.53
41.9
84.8%
19.1%
36
31 t/s
$0.32
41.7
84.0%
86.2%
86.2%
92.0%
22.2%
37
134 t/s
$1.1
41.6
85.7%
23.4%
38
131 t/s
$0.15
41.5
83.5%
20.0%
39
47 t/s
$6.0
41.5
87.7%
86.6%
81.9%
92.7%
23.9%
40
111 t/s
$4.5
41.3
88.7%
89.5%
85.7%
86.7%
27.6%
41
89 t/s
$0.69
41.2
82.8%
83.7%
83.8%
90.7%
19.7%
42
104 t/s
$1.1
40.9
83.8%
84.8%
85.3%
94.7%
22.3%
43
o3-pro
OpenAI
21 t/s
$35.0
40.7
84.5%
44
62 t/s
$1.6
40.6
66.6%
7.2%
45
55 t/s
$1.4
40.1
86.1%
18.8%
46
34 t/s
$2.4
39.9
86.1%
26.2%
47
50 t/s
$0.53
39.4
83.0%
87.5%
81.0%
82.7%
22.2%
48
66 t/s
$3.4
39.2
80.8%
86.0%
76.3%
83.0%
18.4%
49
125 t/s
$0.15
39.2
84.6%
84.3%
86.8%
96.3%
21.1%
50
40 t/s
$30.0
39.0
79.6%
87.3%
63.6%
73.3%
11.7%
51
72 t/s
$0.69
38.9
80.3%
82.8%
69.2%
85.0%
14.6%
52
57 t/s
$6.0
38.7
77.7%
84.2%
65.5%
74.3%
9.6%
53
188 t/s
$0.69
38.6
81.3%
82.0%
83.6%
91.7%
16.9%
54
139 t/s
$0.28
38.6
85.3%
85.4%
82.2%
89.3%
17.6%
55
o3
OpenAI
94 t/s
$3.5
38.4
82.7%
85.3%
80.8%
88.3%
20.0%
56
169 t/s
$0.46
38.1
76.1%
14.7%
57
93 t/s
$0.15
37.8
83.1%
19.1%
58
200 t/s
$1.7
37.7
82.3%
17.1%
59
33 t/s
$1.2
37.3
78.9%
12.3%
60
90 t/s
$0.82
37.2
84.2%
13.2%
61
139 t/s
$2.0
37.1
67.2%
76.0%
61.5%
83.7%
9.7%
62
111 t/s
$0.69
37.1
84.5%
19.7%
63
53 t/s
$6.0
37.1
72.7%
86.0%
59.0%
37.0%
7.1%
64
MiniMax-M2
MiniMax
49 t/s
$0.53
36.1
77.7%
82.0%
82.6%
78.3%
12.5%
65
363 t/s
$0.41
36.0
80.0%
19.2%
66
38 t/s
$0.53
36.0
76.4%
81.3%
74.7%
94.7%
33.4%
67
35 t/s
$30.0
36.0
68
148 t/s
$1.1
35.9
82.7%
14.8%
69
154 t/s
$3.4
35.7
78.5%
83.0%
73.0%
89.0%
8.9%
70
61 t/s
$5.6
35.4
74.8%
10.6%
71
120 t/s
$0.28
35.1
84.7%
85.0%
83.2%
89.7%
17.0%
72
190 t/s
$1.1
35.0
81.2%
88.2%
79.7%
55.7%
14.1%
73
$6.0
34.7
77.2%
83.7%
47.3%
56.3%
10.3%
74
131 t/s
$3.4
34.6
84.4%
86.2%
80.1%
87.7%
21.1%
75
77 t/s
$0.94
34.2
66.4%
79.4%
56.2%
48.0%
6.1%
76
$0.80
33.9
79.2%
85.1%
79.8%
89.7%
15.2%
77
63 t/s
$4.8
33.6
71.2%
81.4%
66.9%
51.0%
7.3%
78
216 t/s
$0.56
33.5
82.2%
16.2%
79
Doubao Seed Code
ByteDance Seed
$0.00
33.5
76.4%
85.4%
76.6%
79.3%
13.3%
80
252 t/s
$0.26
33.3
78.2%
80.8%
87.8%
93.4%
18.5%
81
143 t/s
$1.9
33.1
78.4%
83.2%
85.9%
90.7%
17.5%
82
39 t/s
$30.0
33.0
70.1%
86.0%
54.2%
36.3%
5.9%
83
51 t/s
$6.0
33.0
68.3%
83.7%
44.9%
38.0%
4.0%
84
32 t/s
$0.32
32.9
79.7%
85.0%
78.9%
87.7%
13.8%
85
Mercury 2
Inception
907 t/s
$0.38
32.8
77.0%
15.5%
86
84 t/s
$0.98
32.5
78.0%
82.9%
69.5%
86.0%
13.3%
87
45 t/s
$2.4
32.5
77.6%
82.4%
53.5%
82.3%
12.0%
88
55 t/s
$0.11
32.4
80.6%
13.3%
89
33 t/s
$0.32
32.1
75.1%
83.7%
59.3%
59.0%
10.5%
90
196 t/s
$0.35
32.1
79.1%
82.8%
69.6%
84.7%
11.1%
91
K-EXAONE (Reasoning)
LG AI Research
$0.00
32.1
78.3%
83.8%
76.8%
90.3%
13.1%
92
149 t/s
$3.4
31.9
75.1%
82.2%
63.8%
63.3%
5.2%
93
Qwen3 Max
Alibaba
33 t/s
$2.4
31.4
76.4%
84.1%
76.7%
80.7%
11.1%
94
120 t/s
$2.0
31.1
64.6%
80.0%
51.1%
39.0%
4.3%
95
$0.00
31.1
79.3%
84.2%
71.3%
78.3%
12.7%
96
64 t/s
$1.1
30.9
76.7%
81.9%
61.0%
57.3%
6.3%
97
o1
OpenAI
113 t/s
$26.3
30.8
74.7%
84.1%
67.9%
72.3%
7.7%
98
$6.0
30.8
65.6%
80.3%
39.4%
21.0%
4.8%
99
119 t/s
$0.69
30.7
81.9%
12.8%
100
130 t/s
$0.15
30.4
65.6%
74.4%
40.2%
67.7%
8.0%
Showing top 100 of 446 models. Use search/filter to narrow down.

Benchmark Guide

Intelligence
Source ↗

Composite score across math, science, coding

Graduate-level science Q&A (Diamond)

MMLU-Pro
Source ↗

Knowledge & reasoning across 57 subjects

LiveCodeBench
Source ↗

Live coding benchmark with new problems

AIME 2025
Source ↗

American Invitational Math Exam

MATH-500
Source ↗

Competition-level math problems

Humanity's Last Exam - hardest questions

Composite coding benchmark score

Composite math benchmark score

SciCode
Source ↗

Scientific coding problems

IFBench
Source ↗

Instruction following benchmark

TerminalBench
Source ↗

Terminal/CLI task completion

Compare pricing for all models side by side

Open AI API Cost Calculator →