AI Systems Solve Erdős Problem Without Specialized Training
A non-mathematician used ChatGPT to solve Erdős problem #1196. The solution employed an approach that differed from prior human attempts. Researchers at OpenAI and Google DeepMind reported continued progress on longer proofs.
forbes.comLiam Price, who has no formal mathematics training, used ChatGPT last month to solve Erdős problem #1196. The problem, posed in 1966, concerns primitive sets of whole numbers in which no number divides another. Price collaborated with Cambridge undergraduate Kevin Barreto on earlier Erdős problems.
Mathematician Jared Duker Lichtman at Stanford University posted that the solution used a strategy no prior human solver had considered. Terence Tao at the University of California, Los Angeles, noted that GPT solved the problem in its original formulation rather than converting it to probability language.
Daniel Litt at the University of Toronto described the result as reasonably interesting.
Bubeck at OpenAI stated that a year earlier researchers expected large language models to remain limited to their training data. Thang Luong, who leads the Superhuman Reasoning team at Google DeepMind, said models tested internally can now produce proofs up to ten pages.
Current public models remain limited to proofs of three or four pages. Lauren Williams at Harvard University said human referees already face heavy workloads evaluating mathematics papers and that AI-generated submissions are increasing that burden.
She added that models can produce outputs that appear convincing but require substantial time to verify. Luong said scaling compute and improving algorithmic efficiency are expected to extend proof length further. He noted that reaching one-hundred-page proofs is not currently possible but remains a stated goal.
Key Facts
Story Timeline
3 events- 1966
Paul Erdős posed problem #1196 on primitive sets.
1 source@Nature - May 2026
Liam Price used ChatGPT to solve Erdős problem #1196.
1 source@Nature - May 2026
Jared Duker Lichtman posted comparison to novel chess opening.
1 source@Nature
Potential Impact
- 01
Mathematics journal editors may receive more AI-generated submissions requiring verification time.
- 02
Researchers may test general-purpose language models on additional unsolved problems.
Transparency Panel
Related Stories
thesouthafrican.comSouth African Researchers Develop Quantum and AI Tools for Cybersecurity
Scientists and startup companies in South Africa are applying quantum communication and AI-powered tools to address rising global cyber threats. The work focuses on strengthening data protection methods.
France 24EU Discusses Readiness for Artificial Intelligence Changes
A France 24 program examined whether European Union policies can address the effects of artificial intelligence. The discussion covered potential impacts across daily life and economic sectors.
reason.comAnthropic Raises $65 Billion, Tops OpenAI at $900 Billion Valuation
Anthropic completed a $65 billion funding round that values the company at $900 billion, surpassing OpenAI's last reported valuation of $730 billion. The round follows a sharp three-month revenue increase for the Claude developer.