Compare 446+ AI models across 12 benchmarks — Intelligence, Coding, Math, Science, and more. Data updated hourly.
Composite score across math, science, coding
Graduate-level science Q&A (Diamond)
Knowledge & reasoning across 57 subjects
Live coding benchmark with new problems
American Invitational Math Exam
Competition-level math problems
Humanity's Last Exam - hardest questions
Composite coding benchmark score
Composite math benchmark score
Scientific coding problems
Instruction following benchmark
Terminal/CLI task completion
Compare pricing for all models side by side
Open AI API Cost Calculator →