AI 精选动态
智能评分 60
GLM-5.2持平Claude Opus 4.8
AI 推荐理由
GLM-5.2在资源高效的情况下达到了与Claude Opus 4.8相当的物理推理性能,值得进一步评估其实际应用。核心解读
在CritPt基准上,GLM-5.2取得21%分数,与Claude Opus 4.8持平,后者在总体指数高5分且每token成本数倍。
全文
The thinking does pay off where reasoning is important. CritPt is a frontier physics benchmark developed by Argonne and UIUC, with contributions from 60+ researchers globally and applied by Artificial Analysis.
On CritPt, GLM-5.2 ties Claude Opus 4.8 outright, both at 21%, matching a model that scores 5 points higher overall in the Artificial Analysis Intelligence Index and costs several times as much per token.
