AI Math Performance Jumps from 2% to 40% in Just Months

What Happened Epoch AI’s Frontier Math benchmark has become an unexpected showcase for the explosive pace of AI development. When the non-profit research organization quietly released this standardized test in November 2024, state-of-the-art AI models could solve less than 2% of its challenging mathematical problems. Today, the landscape has transformed dramatically. The best publicly available AI models are now solving over 40% of Frontier Math’s original 300 problems (tiers 1-3), which span from advanced undergraduate to early graduate-level mathematics.

Read more →