AI 精选动态智能评分 60

GLM-5.2持平Claude Opus 4.8

来源: twitter关注列表

作者: Artificial Analysis (@ArtificialAnlys)

发布于: 2026-06-30

收录于: 2026-06-30

AI 推荐理由

GLM-5.2在资源高效的情况下达到了与Claude Opus 4.8相当的物理推理性能，值得进一步评估其实际应用。

核心解读

在CritPt基准上，GLM-5.2取得21%分数，与Claude Opus 4.8持平，后者在总体指数高5分且每token成本数倍。

全文

The thinking does pay off where reasoning is important. CritPt is a frontier physics benchmark developed by Argonne and UIUC, with contributions from 60+ researchers globally and applied by Artificial Analysis. On CritPt, GLM-5.2 ties Claude Opus 4.8 outright, both at 21%, matching a model that scores 5 points higher overall in the Artificial Analysis Intelligence Index and costs several times as much per token. ![photo](https://pbs.twimg.com/media/HMFMnnoaMAA96ZE.jpg)

#模型#分析#技术

阅读原始全文