返回精选
AI 精选动态 智能评分 60

GLM-5.2持平Claude Opus 4.8

来源: twitter关注列表
作者: Artificial Analysis (@ArtificialAnlys)
发布于: 2026-06-30
收录于: 2026-06-30
AI 推荐理由
GLM-5.2在资源高效的情况下达到了与Claude Opus 4.8相当的物理推理性能,值得进一步评估其实际应用。
核心解读
在CritPt基准上,GLM-5.2取得21%分数,与Claude Opus 4.8持平,后者在总体指数高5分且每token成本数倍。
全文
The thinking does pay off where reasoning is important. CritPt is a frontier physics benchmark developed by Argonne and UIUC, with contributions from 60+ researchers globally and applied by Artificial Analysis. On CritPt, GLM-5.2 ties Claude Opus 4.8 outright, both at 21%, matching a model that scores 5 points higher overall in the Artificial Analysis Intelligence Index and costs several times as much per token. ![photo](https://pbs.twimg.com/media/HMFMnnoaMAA96ZE.jpg)
#模型#分析#技术