OpenAI just won gold at the world's most prestigious math competition. Here's why that's a big deal.

OpenAI just won gold at the world's most prestigious math competition. Here's why that's a big deal.

In a groundbreaking achievement, OpenAI's latest experimental reasoning model has secured gold medal-level performance at the International Math Olympiad (IMO), one of the world's most revered mathematics competitions. Alexander Wei, a technical staff member at OpenAI, proudly announced this milestone on social media, highlighting that this accomplishment addresses a long-standing challenge in artificial intelligence. The IMO, which began in 1959 in Romania, is renowned for its difficulty. The competition spans two days, with participants tackling a rigorous four-and-a-half-hour exam that poses three complex problems. Notable past winners include Grigori Perelman, known for his contributions to geometry, and Terence Tao, a Fields Medal laureate considered one of the foremost mathematicians alive today. In a recent podcast, Tao expressed skepticism about AI's prospects in the IMO, suggesting that researchers might want to set their sights lower than this elite competition. Despite these doubts, OpenAI's model successfully solved five out of the six problems presented, operating under the same conditions as human competitors. Noam Brown, a colleague of Wei, commented on the model's extraordinary endurance during the exam, noting that IMO problems require a level of sustained creative thought that surpasses previous benchmarks. He stated, "This model thinks for a long time," emphasizing its advanced capabilities. Wei characterized the model as a significant upgrade in general intelligence, claiming it is "breaking new ground in general-purpose reinforcement learning." In contrast to DeepMind's AlphaGeometry, which is specifically tailored for mathematical tasks, OpenAI's model represents a broader pursuit of general intelligence. OpenAI's CEO, Sam Altman, also reflected on the model's accomplishment, stating that when the organization was founded, such a feat seemed like a distant dream. However, he acknowledged that a model showcasing this level of capability will not be accessible to the public for several months. This achievement illustrates the rapid advancements in AI technology. Just a year ago, AI labs were assessing models based on elementary school math. Tech entrepreneur Peter Thiel previously predicted that it would take at least three more years for AI to tackle problems from the US Math Olympiad. Despite the excitement, some experts remain cautious. Gary Marcus, a prominent AI critic, labeled the model's performance as "genuinely impressive" but raised questions about the training methods, the true extent of its general intelligence, practical applications for the broader public, and the costs associated with each problem. He also noted that the IMO has yet to independently verify these results.

Sources : Business Insider

Published On : Jul 19, 2025, 22:35

AI
Anthropic Eyes Major Funding Boost Valued at $170 Billion Amid Changing Strategies

Anthropic is currently in negotiations to secure between $3 billion and $5 billion in a funding round led by Iconiq Capi...

CNBC | Jul 29, 2025, 19:15
Anthropic Eyes Major Funding Boost Valued at $170 Billion Amid Changing Strategies
Gaming
Memes: The Modern Evolution of Comics in the Digital Age

The Internet has undeniably transformed the landscape of cartooning, providing artists with innovative tools and new ave...

Ars Technica | Jul 29, 2025, 19:25
Memes: The Modern Evolution of Comics in the Digital Age
Computing
EU's Stance on Big Tech Payments Remains Unclear Amid US Claims

Recent statements from the White House suggested that the European Union has decided to abandon a contentious proposal r...

Ars Technica | Jul 29, 2025, 19:55
EU's Stance on Big Tech Payments Remains Unclear Amid US Claims
Startups
Harnessing Plasma: SOSV's Ambitious Plan to Revolutionize Multiple Industries

Investors at SOSV are making bold moves, believing that plasma technology could redefine various sectors, from energy to...

TechCrunch | Jul 29, 2025, 18:35
Harnessing Plasma: SOSV's Ambitious Plan to Revolutionize Multiple Industries
Cybersecurity
Google Denies Receiving UK Backdoor Demands for User Data

The UK government appears to be retracting its earlier request for Apple to create a covert backdoor that would grant au...

TechCrunch | Jul 29, 2025, 20:30
Google Denies Receiving UK Backdoor Demands for User Data
View All News