Olympiad-Level LLM Reasoning

Home Explore Pricing Blog Docs New Tracker

Get the App

Frontier AI Insights

30B-A3B gold IPhO/IMO/USAMO via test-time self-verification. Lean formal verification + evolutionary search solves open Erdős problems; OpenAI reasoning model disproves 1946 Erdős conjecture. First large-scale empirical study of formal theorem proving by LLM agents flagged (Dan Roy). Aligns with RLVR/self-distillation threads.

Sources (2)

Updated May 28, 2026

Frontier AI Insights

Proofs from THE MACHINE. Retrieval, Recombination, Discovery —…

@roydanroy: To my knowledge, this is the first large-scale empirical study of formal theorem proving by LLM agen...