Substrate
ai

AI Systems Solve Erdős Problem Without Specialized Training

A non-mathematician used ChatGPT to solve Erdős problem #1196. The solution employed an approach that differed from prior human attempts. Researchers at OpenAI and Google DeepMind reported continued progress on longer proofs.

NA
1 source·May 23, 10:37 AM(6 days ago)·1m read
|
AI Systems Solve Erdős Problem Without Specialized Trainingforbes.com
Audio version
Tap play to generate a narrated version.

Liam Price, who has no formal mathematics training, used ChatGPT last month to solve Erdős problem #1196. The problem, posed in 1966, concerns primitive sets of whole numbers in which no number divides another. Price collaborated with Cambridge undergraduate Kevin Barreto on earlier Erdős problems.

Mathematician Jared Duker Lichtman at Stanford University posted that the solution used a strategy no prior human solver had considered. Terence Tao at the University of California, Los Angeles, noted that GPT solved the problem in its original formulation rather than converting it to probability language.

Daniel Litt at the University of Toronto described the result as reasonably interesting.

Bubeck at OpenAI stated that a year earlier researchers expected large language models to remain limited to their training data. Thang Luong, who leads the Superhuman Reasoning team at Google DeepMind, said models tested internally can now produce proofs up to ten pages.

Current public models remain limited to proofs of three or four pages. Lauren Williams at Harvard University said human referees already face heavy workloads evaluating mathematics papers and that AI-generated submissions are increasing that burden.

She added that models can produce outputs that appear convincing but require substantial time to verify. Luong said scaling compute and improving algorithmic efficiency are expected to extend proof length further. He noted that reaching one-hundred-page proofs is not currently possible but remains a stated goal.

Key Facts

Erdős problem #1196
Solved by ChatGPT without probability rephrasing
Proof length limit
Current models reach three to four pages
Internal Google models
Reported to reach ten-page proofs
Fields Medal goal
Luong hopes for joint AI-mathematician win by 2030

Story Timeline

3 events
  1. 1966

    Paul Erdős posed problem #1196 on primitive sets.

    1 source@Nature
  2. May 2026

    Liam Price used ChatGPT to solve Erdős problem #1196.

    1 source@Nature
  3. May 2026

    Jared Duker Lichtman posted comparison to novel chess opening.

    1 source@Nature

Potential Impact

  1. 01

    Mathematics journal editors may receive more AI-generated submissions requiring verification time.

  2. 02

    Researchers may test general-purpose language models on additional unsolved problems.

Transparency Panel

Sources cross-referenced1
Confidence score75%
Synthesized bySubstrate AI
Word count234 words
PublishedMay 23, 2026, 10:37 AM
Bias signals removed2 across 1 outlet
Signal Breakdown
Loaded 1Speculative 1

Related Stories

South African Researchers Develop Quantum and AI Tools for Cybersecuritythesouthafrican.com
ai31 min agoDeveloping

South African Researchers Develop Quantum and AI Tools for Cybersecurity

Scientists and startup companies in South Africa are applying quantum communication and AI-powered tools to address rising global cyber threats. The work focuses on strengthening data protection methods.

Reuters
1 source
EU Discusses Readiness for Artificial Intelligence ChangesFrance 24
ai4 hrs agoDeveloping

EU Discusses Readiness for Artificial Intelligence Changes

A France 24 program examined whether European Union policies can address the effects of artificial intelligence. The discussion covered potential impacts across daily life and economic sectors.

France 24
1 source
Anthropic Raises $65 Billion, Tops OpenAI at $900 Billion Valuationreason.com
ai22 hrs agoDeveloping

Anthropic Raises $65 Billion, Tops OpenAI at $900 Billion Valuation

Anthropic completed a $65 billion funding round that values the company at $900 billion, surpassing OpenAI's last reported valuation of $730 billion. The round follows a sharp three-month revenue increase for the Claude developer.

cnbc.com
UN
KO
The New York Times
MarketWatch
5 sources