The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got ...
The closest the field has come to solving the planar unit distance problem, first proposed in the 1940s, was in 1984. Now, OpenAI claims an internal model has cracked the puzzle.
Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Dany Lepage discusses the architectural ...
24-year-old founder and CEO Carina Hong created Axiom Math in March 2025 and has recruited a team of ten employees, most of whom are from Meta, to build a math-focused AI model. Last fall, Carina Hong ...
Chinese AI lab DeepSeek has quietly updated Prover, its AI model that’s designed to solve math-related proofs and theorems. According to South China Morning Post, DeepSeek uploaded the latest version ...
Mathematician Will Sawin discusses his experience reviewing and refining a mathematical proof devised by OpenAI's internal ...
This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...
A Google DeepMind researcher and OpenAI’s former CTO are posing questions about the validity of OpenAI’s claim about its gold-medal score. OpenAI’s latest model has achieved a gold-level score at the ...