AI 精选动态智能评分 62

LLM攻克理论CS开放问题

来源: twitter关注列表

作者: Marc Andreessen 🇺🇸 (@pmarca)

发布于: 2026-06-30

收录于: 2026-06-30

AI 推荐理由

与以往 LLM 数学研究不同，该方法通过验证器循环解决了长期悬而未决的开放问题，值得关注其对 LLM 推理能力边界的拓展。

核心解读

Binghui Peng 团队使用 GPT 5.5 Pro 和 Claude Opus 4.8 构建 prover-verifier 循环，解决了 9 个理论计算机科学开放问题，包括 COLT 和 FOCS 会议上的 5 个以及交换代数中的 4 个，并计划扩展到所有科学领域。

全文

Marc Andreessen 🇺🇸 (@pmarca) 转发了 Omri Weinstein (@WeinsteinOmri) 的帖子： Even @OpenAI's recent Erdős breakthrough didn't convince me that LLMs can do general math research. This changed my mind.. Using a clever 'prover-verifier' LLM loop, this harness solved 9 substantial open problems in Theoretical CS, including one that kept me up at night for 2 years. Incredible work by my former Columbia collaborator @binghuip, @runzhou_tao, Steven Wang & @HantaoYu_Theory. The plan is to expand this to ALL fields of science. Stay tuned. > **引用原帖 Binghui Peng (@binghuip):** > [1/n] Recent OpenAI research has demonstrated the ability of LLMs to solve frontier problems in mathematics. We design a simple pipeline (using GPT 5.5 Pro and Claude Opus 4.8) that resolves 9 challenging open problems, including open problems from prominent theoretical computer science venues—4 from COLT open problem list and 1 from FOCS —as well as 4 problems from the commutative algebra. > Project link: https://t.co/YCBzYjfz3N, joint work with @runzhou_tao, Steven Wang & @HantaoYu_Theory > https://x.com/binghuip/status/2070756087998152855

#技术突破#AI模型#研究

阅读原始全文