Back to List
OpenAI Reasoning Model Disproves Longstanding Erdős Conjecture in Discrete Geometry
Research BreakthroughOpenAIMathematicsArtificial Intelligence

OpenAI Reasoning Model Disproves Longstanding Erdős Conjecture in Discrete Geometry

On May 20, 2026, OpenAI announced a major research milestone: an internal general-purpose reasoning model has disproved a central conjecture in discrete geometry. The breakthrough concerns the planar unit distance problem, a question first posed by Paul Erdős in 1946 regarding the maximum number of unit-distance pairs among n points in a plane. For nearly 80 years, mathematicians believed that square grid constructions were optimal for this problem. However, the OpenAI model identified an infinite family of examples providing a polynomial improvement over previous theories. Verified by external mathematicians, this result is particularly significant because it was achieved by a general-purpose model rather than a system specifically trained for mathematics, signaling a new era for AI in frontier scientific research.

Hacker News

Key Takeaways

  • Historical Breakthrough: An OpenAI model has disproved the planar unit distance problem conjecture, a challenge that has occupied mathematicians since Paul Erdős first posed it in 1946.
  • Polynomial Improvement: The model discovered an infinite family of examples that surpass the efficiency of the "square grid" constructions previously thought to be optimal.
  • General-Purpose Success: The proof was generated by a general-purpose reasoning model rather than a specialized mathematical tool or a system scaffolded for proof searching.
  • External Verification: A group of external mathematicians has checked the proof and authored a companion paper to provide context and explain the argument's significance.

In-Depth Analysis

The Planar Unit Distance Problem

The planar unit distance problem is a fundamental question in combinatorial geometry. It asks a deceptively simple question: if you place n points in a plane, what is the maximum number of pairs of points that can be exactly a distance of 1 apart? Despite its simplicity, the problem has proven remarkably difficult to resolve. In the 2005 book Research Problems in Discrete Geometry, authors Brass, Moser, and Pach described it as "possibly the best known (and simplest to explain) problem in combinatorial geometry."

For decades, the mathematical community, including Paul Erdős himself, suspected that square grid constructions were essentially the optimal way to maximize these unit-distance pairs. Erdős, who considered this one of his favorite problems, even offered a monetary prize for its resolution. The OpenAI model's discovery of an infinite family of examples that provide a polynomial improvement fundamentally changes the understanding of this 80-year-old problem.

A New Paradigm for Mathematical Discovery

Perhaps as significant as the mathematical result itself is the method by which it was found. The proof did not come from a specialized AI trained specifically for mathematics or a system designed to search through known proof strategies. Instead, it originated from a new general-purpose reasoning model. This model was evaluated on a collection of Erdős problems as part of a broader effort to determine if advanced AI can contribute to frontier research.

This achievement suggests that general-purpose reasoning capabilities are reaching a level where they can tackle open problems in pure science. The model produced a complete proof that was robust enough to be verified by leading external mathematicians, such as Noga Alon of Princeton. The transition from AI as a supportive tool to AI as a primary discoverer of new mathematical truths marks a shift in how frontier research may be conducted in the future.

Industry Impact

The implications of this breakthrough for the AI industry are profound. First, it demonstrates that the development of general-purpose reasoning models is yielding results that exceed the capabilities of specialized, domain-specific systems in certain high-level tasks. This validates the industry's current focus on scaling general reasoning as a path toward solving complex scientific problems.

Second, this milestone moves AI evaluation beyond standard benchmarks and into the realm of "frontier research." By solving a problem that has remained open for nearly 80 years, OpenAI has provided a concrete example of AI's utility in expanding the boundaries of human knowledge. This is likely to accelerate the integration of AI models into academic and industrial research workflows, particularly in fields like geometry, combinatorics, and theoretical physics where complex structural conjectures are common.

Frequently Asked Questions

Question: What is the significance of the "square grid" in this context?

For nearly 80 years, mathematicians believed that arranging points in a square grid was the most efficient way to maximize the number of unit-distance pairs. The OpenAI model disproved this by finding a different, more efficient family of examples.

Question: Was the AI specifically designed to solve geometry problems?

No. OpenAI stated that the proof came from a general-purpose reasoning model. It was not specifically trained for mathematics or targeted at the unit distance problem, but was tested on a variety of Erdős problems to evaluate its research capabilities.

Question: How do we know the proof is correct?

The proof has been checked and verified by a group of external mathematicians. They have also produced a companion paper that explains the logic of the argument and provides further background on its significance.

Related News

Meituan LongCat Team Unveils WBench: The First Systematic Multi-Round Benchmark for Interactive Video World Models
Research Breakthrough

Meituan LongCat Team Unveils WBench: The First Systematic Multi-Round Benchmark for Interactive Video World Models

The Meituan LongCat team has officially introduced and open-sourced WBench, a pioneering evaluation benchmark designed to measure the capabilities of interactive video world models. As the first systematic framework for multi-round interaction assessment, WBench serves as a diagnostic tool—likened to a 'CT scanner'—to identify the specific technical hurdles AI models face when transitioning from passive observation to active, multi-stage interaction. By testing models across diverse scenarios ranging from lunar environments to futuristic urban settings, WBench establishes a new standard for defining the boundaries of world models. This release marks a significant step in providing the AI research community with the tools necessary to pinpoint and resolve the bottlenecks currently limiting the development of truly interactive artificial intelligence.

Meituan LongCat Team Releases General 365 Benchmark Revealing Significant Reasoning Gaps in Leading AI Models
Research Breakthrough

Meituan LongCat Team Releases General 365 Benchmark Revealing Significant Reasoning Gaps in Leading AI Models

The Meituan LongCat team has officially introduced General 365, a new benchmark designed to evaluate the reasoning capabilities of large language models (LLMs). In a comprehensive assessment of 26 mainstream models, the results indicate a challenging landscape for current AI technology. Even Gemini 3 Pro, currently regarded as one of the most powerful models available, achieved an accuracy rate of only 62.8%. The benchmark results further reveal that the vast majority of tested models failed to reach a 60% accuracy threshold, which is often considered a basic passing grade. This release by Meituan's technical team establishes a rigorous new standard for measuring AI reasoning, highlighting that most current models still struggle with complex logical tasks.

LARYBench Launch: Defining the ImageNet for Embodied Action Representations and Measuring Generalization from Human Video Data
Research Breakthrough

LARYBench Launch: Defining the ImageNet for Embodied Action Representations and Measuring Generalization from Human Video Data

The Meituan Technical Team has introduced LARYBench (Latent Action Representation Yielding Benchmark), a systematic evaluation framework designed to guide the learning of general latent action representations from large-scale visual data. This benchmark serves as a foundational tool, akin to ImageNet for computer vision, but specifically tailored for embodied intelligence. Experimental results from the benchmark reveal a significant discovery: general vision models demonstrate superior performance in action generalization and control precision compared to specialized action expert models designed specifically for embodied AI. This indicates that sophisticated embodied action representations can emerge naturally from training on extensive human video datasets, suggesting a new pathway for developing robotic control systems through general-purpose visual learning.