返回精选
AI 精选动态 智能评分 60

GLM-5.2登顶Design Arena

来源: twitter关注列表
作者: elvis (@omarsar0)
发布于: 2026-06-17
收录于: 2026-06-17
AI 推荐理由
展示了开源模型在创意设计基准上首次超越闭源前沿模型,值得关注其在长时任务和复杂指令上的后续表现。
核心解读
Zai_org发布的GLM-5.2模型在Design Arena上以Elo 1360分跃居第一,超越此前榜首Claude Fable 5(现不可用),提升4位和27 Elo点,且模型权重开源。评论指出该模型在游戏、落地页、HTML制品和3D世界设计方面表现良好,但尚未达到专业设计师水平。
全文
I was a bit suspicious of the claim, but GLM-5.2 is pretty good at designing stuff. Obviously not at the level of a professional designer, but it has that Opus-level quality. Great at: - games - landing pages - HTML artifacts - 3D worlds Wish I had Fable 5 to compare with. https://t.co/qco7AKIrCv https://video.twimg.com/amplify_video/2067343441785171969/vid/avc1/1920x1080/4wqAG-uFTv0x18Ia.mp4?tag=28 > **引用原帖 Design Arena (@Designarena):** > BREAKING: GLM-5.2 is now 1st on Design Arena. > With an Elo of 1360, GLM-5.2 has jumped ahead of the now unavailable Claude Fable 5. > And it's open weights. > This is an improvement of 4 positions and 27 Elo points to achieve one of the highest Elo scores in our code categories since Design Arena started. > Huge congratulations to the @Zai_org on the release! > https://x.com/Designarena/status/2066940737011560652 elvis (@omarsar0): YT if you prefer that: https://t.co/4XM1JO0GfS elvis (@omarsar0): The next set of tasks is going to be on long-running tasks. Very curious how it compares with the frontier models on this. Like, how does it work with /loop and /goal? Reporting back soon.
#模型发布#基准测试#开源