Google Deep Mind makes AI history with gold medal win at world’s toughest math competition

Google Deep Mind makes AI history with gold medal win at world’s toughest math competition

Google DeepMind has achieved a groundbreaking milestone with its enhanced Gemini AI model, which recently earned a gold medal at the prestigious International Mathematical Olympiad (IMO). This remarkable performance marks the first time an AI has received an official gold-level rating from the competition organizers, solving five out of six complex mathematical problems. The success showcases significant advancements in AI reasoning abilities, positioning Google at the forefront of the ongoing competition among tech giants vying to develop next-generation artificial intelligence. Notably, this achievement illustrates AI's capability to address intricate mathematical challenges through natural language comprehension, eliminating the need for specialized programming languages. Demis Hassabis, CEO of Google DeepMind, proudly shared the news on social media, stating, “Official results are in — Gemini achieved gold-medal level in the International Mathematical Olympiad! An advanced version was able to solve 5 out of 6 problems. Incredible progress.” The IMO, which has been held annually since 1959, is recognized as the world's foremost mathematics competition for pre-university students. Each participating nation sends a team of six elite young mathematicians to tackle six exceptionally challenging problems that span various mathematical disciplines, including algebra, geometry, and number theory. Historically, only about 8% of human participants earn gold medals. Google's latest achievement significantly surpasses its 2024 performance, where its earlier AI systems, Alpha Proof and Alpha Geometry, only managed to secure silver medal status by solving four problems. This year’s innovative breakthrough stemmed from the Gemini Deep Think system, which employs a method known as “parallel thinking.” Unlike traditional AI models that follow a single reasoning path, Deep Think explores multiple potential solutions concurrently before finalizing an answer. Hassabis elaborated on the system's capabilities, stating that it operated entirely in natural language, generating rigorous mathematical proofs directly from the problem statements within the competition's strict 4.5-hour limit. The model earned an impressive 35 out of a possible 42 points, comfortably surpassing the threshold for a gold medal. The solutions were described by IMO President Prof. Dr. Gregor Dolinar as “astonishing in many respects,” highlighting their clarity and precision. Against a backdrop of increasing scrutiny within the AI sector regarding competitive practices and transparency, Google DeepMind's careful approach to announcing its results has received commendation, particularly when compared to rival OpenAI's handling of similar achievements. Hassabis mentioned that the timing of their announcement was in respect of the IMO Board’s request for all AI labs to share results only after independent verification. Criticism has been directed at OpenAI for announcing its own mathematical performance prematurely, without adhering to the official evaluation process. Social media commentators have noted the contrast in approaches, praising Google for its integrity and alignment with ethical standards. DeepMind's triumph is attributed to innovative training techniques that extend beyond conventional methods, utilizing advanced reinforcement learning to enhance multi-step reasoning and theorem-proving capabilities. This model benefited from access to a curated database of high-quality mathematical solutions and received tailored guidance for IMO-style problems. AI experts have recognized the broader implications of this achievement, noting that it signifies a shift from rote memorization to true cognitive understanding in AI. The model's adeptness was particularly highlighted in one problem where many human competitors resorted to advanced mathematical concepts. According to DeepMind researcher Junehyuk Jung, Gemini “made a brilliant observation and utilized only elementary number theory to devise a self-contained proof,” showcasing a more elegant solution than its human counterparts. As the AI industry remains competitive, Google plans to provide a version of the Deep Think model for mathematicians to test before making it available to subscribers of their premium Google AI Ultra service. The timing of this announcement underscores the ongoing rivalry among major AI laboratories, as various companies continue to unveil new capabilities, albeit some of which have faced backlash. This victory at the mathematical olympiad is not merely a matter of competitive pride. Gemini's performance indicates that AI systems are now capable of matching human-level reasoning in complex tasks that require creativity and abstract thinking. This evolution from needing specialized programming languages to functioning entirely in natural language suggests that AI is becoming increasingly intuitive and accessible. Businesses may soon leverage these advanced analytical capabilities without the need for domain-specific expertise, potentially democratizing access to sophisticated problem-solving tools across various industries. However, challenges remain in applying these reasoning skills to the more chaotic complexities of real-world scenarios. Looking ahead, Google DeepMind aims to return to next year's IMO with aspirations for a perfect score, believing that AI combining natural language fluency with rigorous reasoning will be invaluable for mathematicians, scientists, and researchers alike. Yet, a telling aspect of this competition was that when confronted with the toughest problem, Gemini began with an incorrect assumption and did not recover. Ultimately, only five human students successfully solved that particular problem, serving as a reminder that even gold medal-winning AI has much to learn from its human counterparts.

Sources : VentureBeat

Published On : Jul 22, 2025, 23:05

Computing
Meta and Nvidia Forge Major Alliance to Transform AI Infrastructure

Meta is significantly enhancing its partnership with Nvidia through a groundbreaking agreement described as 'multigenera...

Business Insider | Feb 18, 2026, 22:45
Meta and Nvidia Forge Major Alliance to Transform AI Infrastructure
AI
India Poised to Lead AI Revolution, Says DeepMind CEO at 2026 Summit

During the India AI Impact Summit 2026, Demis Hassabis, the CEO of Google DeepMind, expressed strong confidence that Ind...

Business Today | Feb 19, 2026, 07:30
India Poised to Lead AI Revolution, Says DeepMind CEO at 2026 Summit
AI
Sundar Pichai Highlights AI's Transformative Power at India AI Impact Summit 2026

At the India AI Impact Summit 2026 held at Bharat Mandapam, Google and Alphabet CEO Sundar Pichai underscored the monume...

Business Today | Feb 19, 2026, 06:00
Sundar Pichai Highlights AI's Transformative Power at India AI Impact Summit 2026
Startups
Etsy Transfers Depop to eBay in $1.2 Billion Deal

In a significant move, Etsy has announced the sale of its secondhand clothing platform, Depop, to eBay for a staggering ...

TechCrunch | Feb 18, 2026, 23:25
Etsy Transfers Depop to eBay in $1.2 Billion Deal
Gadgets
China's Robotic Revolution: A Dazzling Display of Kung Fu and Innovation

During the recent Lunar New Year celebrations, China showcased its burgeoning robotics industry in a spectacular fashion...

Business Insider | Feb 19, 2026, 05:25
China's Robotic Revolution: A Dazzling Display of Kung Fu and Innovation
View All News