Close Menu
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
What's Hot

Canada’s Mark Carney aims for reset with important China visit | Political News

Mira Murati’s startup Thinking Machines Lab loses two co-founders to OpenAI

Venezuela’s Rodriguez vows to release more prisoners, talks by phone with President Trump | Nicolás Maduro News

Facebook X (Twitter) Instagram
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
  • Home
  • Aerospace & Defense
    • Automation & Process Control
      • Automotive & Transportation
  • Banking & Finance
    • Chemicals & Materials
    • Consumer Goods & Services
  • Economy
    • Electronics & Semiconductor
  • Energy & Resources
    • Food & Beverage
    • Hospitality & Tourism
    • Information Technology
  • Agriculture
  • Home
  • About Us
  • Market Research Reports and Company
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
USA Business Watch – Insightful News on Economy, Finance, Politics & Industry
Home » AI models are starting to decipher high-level math problems
Information Technology

AI models are starting to decipher high-level math problems

Bussiness InsightsBy Bussiness InsightsJanuary 14, 2026No Comments4 Mins Read
Share Facebook Twitter Pinterest Copy Link Telegram LinkedIn Tumblr Email
Share
Facebook Twitter LinkedIn Pinterest Email


Neel Somani, a software engineer, former quantitative researcher, and startup founder, was testing the math skills of OpenAI’s new models last weekend when he made an unexpected discovery. After pasting the problem into ChatGPT and letting it think for 15 minutes, I came back with a complete solution. He evaluated the proof and formalized it using a tool called Harmonic, and everything went well.

“I was interested in establishing a baseline for when LLMs can effectively solve unsolved math problems compared to when they are struggling,” Somani said. What surprised me was that Frontier started to move forward little by little with the latest model.

ChatGPT’s chain of thought is even more impressive, rattling off mathematical axioms such as Legendre’s formula, Bertrand’s postulate, and the Star of David theorem. Eventually, the model found a 2013 Math Overflow post. There, Harvard mathematician Noam Elkies had an elegant solution to a similar problem. However, ChatGPT’s final proof differed from Elkies’ work in important ways and provided a more complete solution to the version of the problem posed by legendary mathematician Paul Erdős. His vast collection of unsolved problems has become a testing ground for AI.

For machine intelligence skeptics, this is a surprising result, but it’s not the only one. From formalization-oriented LLMs like Harmonic’s Aristotle to literature review tools like OpenAI’s Deep Research, AI tools are widespread in mathematics. But since the release of GPT 5.2, which Somani says is “anecdotally more proficient at mathematical reasoning than previous versions,” it has become difficult to ignore the sheer volume of problems solved, raising new questions about the ability of large-scale language models to push the frontiers of human knowledge.

Mr. Somani was paying attention to the Erdos issue. Erdos Problems is a set of over 1,000 conjectures by Hungarian mathematicians maintained online. These problems vary widely in both subject matter and difficulty, making them attractive targets for AI-driven mathematics. The first batch of autonomous solutions was delivered in November with a Gemini-powered model called AlphaEvolve. But recently, Somani and colleagues discovered that GPT 5.2 is very good at high-level mathematics.

Since Christmas, 15 issues have been changed from “open” to “resolved” on the Erdos website, with 11 of the resolutions specifically acknowledging that an AI model is involved in the process.

Respected mathematician Terence Tao offers a more nuanced analysis of the progress on his GitHub page, counting eight different cases where AI models have made meaningful autonomous progress on the Erdos problem, and six other cases where they have discovered and built on prior research. Although we have a long way to go before AI systems can perform mathematics without human intervention, it is clear that large-scale models have an important role to play.

tech crunch event

san francisco
|
October 13-15, 2026

Regarding Mastodon, Tao speculates that the scalable nature of AI systems makes them well-suited to “systematically apply to the ‘long tail’ of Erdos problems, many of which actually have simple solutions.”

“Many of these simple Erdos problems are therefore more likely to be solved by purely AI-based methods than by human or hybrid means,” Tao continued.

Another driver is the recent move toward formalization, a labor-intensive task that facilitates the validation and extension of mathematical reasoning. Formalization does not require the use of AI or computers, but the advent of new automated tools has made the process much easier. Lean, an open source “proof assistant” developed at Microsoft Research in 2013, has become widely used in the field as a way to formalize proofs, and AI tools like Harmonic’s Aristotle are expected to automate much of the formalization work.

For Harmonic founder Tudor Achim, the fact that Erdos’ problem was suddenly solved is less important than the fact that the world’s greatest mathematicians are starting to take these tools seriously. “I’m more concerned about the fact that math and computer science professors are using it. [AI tools]”These people have reputations to protect, so when they say they’re using Aristotle or they’re using ChatGPT, that’s real evidence,” Achim said.



Source link

Follow on Google News Follow on Flipboard
Share. Facebook Twitter Pinterest LinkedIn Tumblr Email Copy Link
Previous ArticleBanks bet on being able to circumvent price controls
Next Article Digg unveils new Reddit rival to the public
Bussiness Insights
  • Website

Related Posts

Mira Murati’s startup Thinking Machines Lab loses two co-founders to OpenAI

January 15, 2026

California authorities launch investigation, Musk denies knowledge of Grok’s images of minors

January 15, 2026

Bandcamp takes action against AI music, bans it from platform

January 14, 2026
Leave A Reply Cancel Reply

Latest Posts

‘Hardest year’ for British arable farming, with Frontier profits more than halved

Up to 90 jobs at risk as Muller restructures Skelmersdale dairy facility

‘Farmers need to be treated better’ in new rail plan

Scottish farmers face flat funding again this year under Budget 2026-27

Latest Posts

Boeing will surpass Airbus’ sales in 2025 for the first time since 2018

January 13, 2026

Delta Air Lines (DAL) 2025 Q4 Earnings

January 13, 2026

Greenland and Venezuela crises accelerate huge spending in Europe’s war economy

January 13, 2026

Subscribe to News

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Recent Posts

  • Canada’s Mark Carney aims for reset with important China visit | Political News
  • Mira Murati’s startup Thinking Machines Lab loses two co-founders to OpenAI
  • Venezuela’s Rodriguez vows to release more prisoners, talks by phone with President Trump | Nicolás Maduro News
  • California authorities launch investigation, Musk denies knowledge of Grok’s images of minors
  • FTC’s data sharing order against GM finally settled

Recent Comments

  1. one_jdpa on Hundreds gather in Barcelona to protest overtourism in southern Europe
  2. Salvatore Carnevale on Connect category management to the shopper experience
  3. Jerold Lush on Connect category management to the shopper experience
  4. FrankMoone on 100% tariffs on Trump’s drugs: What we know | Donald Trump News
  5. remont_pcka on Hundreds gather in Barcelona to protest overtourism in southern Europe

Welcome to USA Business Watch – your trusted source for real-time insights, in-depth analysis, and industry trends across the American and global business landscape.

At USABusinessWatch.com, we aim to inform decision-makers, professionals, entrepreneurs, and curious minds with credible news and expert commentary across key sectors that shape the economy and society.

Facebook X (Twitter) Instagram Pinterest YouTube

Subscribe to Updates

Subscribe to our newsletter and never miss our latest news

Subscribe my Newsletter for New Posts & tips Let's stay updated!

Archives

  • January 2026
  • December 2025
  • November 2025
  • October 2025
  • September 2025
  • August 2025
  • July 2025
  • June 2025
  • March 2022
  • January 2021

Categories

  • Aerospace & Defense
  • Agriculture
  • Automation & Process Control
  • Automotive & Transportation
  • Banking & Finance
  • Chemicals & Materials
  • Consumer Goods & Services
  • Economy
  • Economy
  • Electronics & Semiconductor
  • Energy & Resources
  • Food & Beverage
  • Hospitality & Tourism
  • Information Technology
  • Political
Facebook X (Twitter) Instagram Pinterest
  • Home
  • About Us
  • Market Research Reports and Company
  • Contact us
  • DMCA
  • Privacy Policy
  • Terms & Conditions
© 2026 usabusinesswatch. Designed by usabusinesswatch.

Type above and press Enter to search. Press Esc to cancel.