AI 精选动态
智能评分 60
Towards Automating Scientific Review with Google's Paper Assistant Tool
AI 推荐理由
与常规 AI 审稿不同,该工具聚焦客观验证而非接收/拒绝决策,在已知错误检出上显著优于单模型调用,值得关注其后续部署与扩展。核心解读
Google 提出 agentic verification 框架并推出 Paper Assistant Tool,用于自动化科学论文审查。该工具将论文分拆审查,重点检测证明错误、实验漏洞、缺失比较等客观问题。在 STOC 和 ICML 的 author-facing 测试中,工具比单次模型调用发现更多已知错误,多位作者据此修正了理论漏洞或补充了实验。
全文
Big new paper release of Google for external agentic verification for science.
Science now needs AI review agents because AI is making papers faster than humans can check them.
The problem is that AI can help produce more research, but the slow part is still checking whether the work is actually correct.
The paper frames this as verification debt, where every faster research workflow creates more claims, proofs, experiments, and comparisons that someone still has to inspect.
Its main proposal is agentic verification, where AI agents help review papers by splitting them into parts, checking difficult sections deeply, and combining the findings into a review.
Google’s Paper Assistant Tool is the example system, and it focuses on objective checks like proof errors, experimental gaps, missing comparisons, and unclear claims rather than final accept or reject decisions.
The authors tested it on known math and computer science paper errors and in author-facing pilots at STOC and ICML, where authors used it before submission.
The striking result is that Paper Assistant Tool found far more known proof errors than a single model call, and many authors said it led them to fix serious theory gaps or run new experiments.
The big deal is that scientific review may need its own AI stack, with review agents, clear roles, and human oversight, because paper generation is becoming partly automated too.
----
Link – arxiv. org/abs/2606.28277
Title: "Towards Automating Scientific Review with Google's Paper Assistant Tool"
