OpenAI solves IMO math tasks better than most humans
The math world just witnessed a historic event. An experimental reasoning model from OpenAI has solved math tasks from the International Mathematical Olympiad (IMO) at a gold medal level. Link in the description. Although the exact model name hasn’t been disclosed, it is known that it hasn’t been released yet and it’s not GPT-5.
The AI system successfully solved 5 out of 6 math tasks. The evaluation followed the same rules used for human participants: the model had 9 hours to think, no internet access, and was required to provide fully reasoned proofs in natural language.
In total, the AI scored 35 out of a possible 42 points — enough for a solid gold medal. No AI model had ever achieved such impressive results at the Math Olympiad before.
Interestingly, researchers at Google DeepMind were also ready to announce their own model’s success on the same IMO math tasks, also at gold medal level. However, they had to wait for marketing approval, so their official announcement is expected later this week. Meanwhile, OpenAI CEO Sam Altman has already declared the achievement and received widespread recognition.
Autor: AIvengo
For 5 years I have been working with machine learning and artificial intelligence. And this field never ceases to amaze, inspire and interest me.
Latest News
Grok 4 Fast operates 10x faster with 2 million token contextGrok 4 Fast enters the AI arena! Elon Musk's company introduced a revolutionary update to its flagship model. Available in early access for premium users. According to TestingCatalog, the newcomer functions 10x faster than standard Grok 4. While maintaining all advantages of the full reasoning model.
$200 USB cable transforms into autonomous AI hackerResearchers from Palisade Research created a new cybersecurity threat. A modified USB cable that becomes a conduit for autonomous AI into computer systems. The $200 device contains a programmable microchip that loads a digital agent directly onto the target machine.
xAI lays off 500 annotators for Grok's expert specializationA strategic pivot from xAI is emerging. The company is radically changing its approach to training its Grok language model! Elon Musk's team fired 500 universal annotators in one day. Instead, it's increasing the number of specialized AI tutors by 10 times.
Gemini content review time reduced from 30 to 15 minutesAlarming signals from Google's internal kitchen were published by The Guardian. Content evaluators for the Gemini model shared interesting information about declining review standards. Employees of contractor GlobalLogic, responsible for assessing quality and safety of AI responses before release, are sounding alarms.