Base Models for Math - Search News

AI scores a ‘C–’ on its hardest math test yet

The second batch of “First Proof” problems is meant to evaluate AI’s usefulness for research-level math. The best model got ...

Live Science

OpenAI's internal AI model just solved an 80-year-old math problem ‪—‬ and mathematicians verified it

The closest the field has come to solving the planar unit distance problem, first proposed in the 1940s, was in 1984. Now, OpenAI claims an internal model has cracked the puzzle.

InfoQ

Alibaba Releases Two Open-Weight Language Models for Math and Voice Chat

Unlock the full InfoQ experience by logging in! Stay updated with your favorite authors and topics, engage with content, and download exclusive resources. Dany Lepage discusses the architectural ...

Forbes

Meet The Stanford Dropout Building An AI To Solve Math’s Hardest Problems—And Create Harder Ones

24-year-old founder and CEO Carina Hong created Axiom Math in March 2025 and has recruited a team of ten employees, most of whom are from Meta, to build a math-focused AI model. Last fall, Carina Hong ...

TechCrunch

DeepSeek upgrades its math-focused AI model Prover

Chinese AI lab DeepSeek has quietly updated Prover, its AI model that’s designed to solve math-related proofs and theorems. According to South China Morning Post, DeepSeek uploaded the latest version ...

17d

An OpenAI Model ‘Disproved’ a Famous Math Conjecture. This Mathematician Couldn’t Leave It Alone

Mathematician Will Sawin discusses his experience reviewing and refining a mathematical proof devised by OpenAI's internal ...

EurekAlert!

MathEval: a comprehensive benchmark for evaluating large language models on mathematical reasoning capabilities

This study introduces MathEval, a comprehensive benchmarking framework designed to systematically evaluate the mathematical reasoning capabilities of large language models (LLMs). Addressing key ...

TechRepublic

OpenAI Model Wins Gold at International Mathematical Olympiad – or Did It?

A Google DeepMind researcher and OpenAI’s former CTO are posing questions about the validity of OpenAI’s claim about its gold-medal score. OpenAI’s latest model has achieved a gold-level score at the ...

Some results have been hidden because they may be inaccessible to you

Show inaccessible results