返回精选
AI 精选动态 智能评分 62

LLM攻克理论CS开放问题

来源: twitter关注列表
作者: Marc Andreessen 🇺🇸 (@pmarca)
发布于: 2026-06-30
收录于: 2026-06-30
AI 推荐理由
与以往 LLM 数学研究不同,该方法通过验证器循环解决了长期悬而未决的开放问题,值得关注其对 LLM 推理能力边界的拓展。
核心解读
Binghui Peng 团队使用 GPT 5.5 Pro 和 Claude Opus 4.8 构建 prover-verifier 循环,解决了 9 个理论计算机科学开放问题,包括 COLT 和 FOCS 会议上的 5 个以及交换代数中的 4 个,并计划扩展到所有科学领域。
全文
Marc Andreessen 🇺🇸 (@pmarca) 转发了 Omri Weinstein (@WeinsteinOmri) 的帖子: Even @OpenAI's recent Erdős breakthrough didn't convince me that LLMs can do general math research. This changed my mind.. Using a clever 'prover-verifier' LLM loop, this harness solved 9 substantial open problems in Theoretical CS, including one that kept me up at night for 2 years. Incredible work by my former Columbia collaborator @binghuip, @runzhou_tao, Steven Wang & @HantaoYu_Theory. The plan is to expand this to ALL fields of science. Stay tuned. > **引用原帖 Binghui Peng (@binghuip):** > [1/n] Recent OpenAI research has demonstrated the ability of LLMs to solve frontier problems in mathematics. We design a simple pipeline (using GPT 5.5 Pro and Claude Opus 4.8) that resolves 9 challenging open problems, including open problems from prominent theoretical computer science venues—4 from COLT open problem list and 1 from FOCS —as well as 4 problems from the commutative algebra. > Project link: https://t.co/YCBzYjfz3N, joint work with @runzhou_tao, Steven Wang & @HantaoYu_Theory > https://x.com/binghuip/status/2070756087998152855
#技术突破#AI模型#研究