AIME 2025
reasoningAIME (American Invitational Mathematics Examination) 2025 consists of 15 extremely challenging mathematics problems. AIME is a prestigious competition that serves as a qualifier for the USA Mathematical Olympiad.
7
Models Tested
96.7
Best Score
87.3
Average Score
0–100
Scale Range
1.4x
Weight
How It Works
Models solve 15 problems where each answer is an integer from 0 to 999. Problems require sophisticated mathematical reasoning across algebra, geometry, number theory, and combinatorics. Being from 2025, these problems were unlikely to appear in training data.
Why It Matters
AIME 2025 is particularly valuable because the problems are recent enough to avoid data contamination. The difficulty level (top 5% of US high school mathematicians qualify) makes it an excellent discriminator for frontier model reasoning.
Limitations
Only 15 problems means high variance in scores. Integer-only answers miss the reasoning process. Problems are specifically designed for mathematical competition style, not real-world maths applications.