AI 精选动态
智能评分 62
LLM攻克理论CS开放问题
AI 推荐理由
与以往 LLM 数学研究不同,该方法通过验证器循环解决了长期悬而未决的开放问题,值得关注其对 LLM 推理能力边界的拓展。核心解读
Binghui Peng 团队使用 GPT 5.5 Pro 和 Claude Opus 4.8 构建 prover-verifier 循环,解决了 9 个理论计算机科学开放问题,包括 COLT 和 FOCS 会议上的 5 个以及交换代数中的 4 个,并计划扩展到所有科学领域。
全文
Marc Andreessen 🇺🇸 (@pmarca) 转发了 Omri Weinstein (@WeinsteinOmri) 的帖子:
Even @OpenAI's recent Erdős breakthrough didn't convince me that LLMs can do general math research. This changed my mind..
Using a clever 'prover-verifier' LLM loop, this harness solved 9 substantial open problems in Theoretical CS, including one that kept me up at night for 2 years.
Incredible work by my former Columbia collaborator @binghuip, @runzhou_tao, Steven Wang & @HantaoYu_Theory.
The plan is to expand this to ALL fields of science. Stay tuned.
> **引用原帖 Binghui Peng (@binghuip):**
> [1/n] Recent OpenAI research has demonstrated the ability of LLMs to solve frontier problems in mathematics. We design a simple pipeline (using GPT 5.5 Pro and Claude Opus 4.8) that resolves 9 challenging open problems, including open problems from prominent theoretical computer science venues—4 from COLT open problem list and 1 from FOCS —as well as 4 problems from the commutative algebra.
> Project link: https://t.co/YCBzYjfz3N, joint work with @runzhou_tao, Steven Wang & @HantaoYu_Theory
> https://x.com/binghuip/status/2070756087998152855