AI 精选动态
智能评分 60
GLM-5.2登顶Design Arena
AI 推荐理由
展示了开源模型在创意设计基准上首次超越闭源前沿模型,值得关注其在长时任务和复杂指令上的后续表现。核心解读
Zai_org发布的GLM-5.2模型在Design Arena上以Elo 1360分跃居第一,超越此前榜首Claude Fable 5(现不可用),提升4位和27 Elo点,且模型权重开源。评论指出该模型在游戏、落地页、HTML制品和3D世界设计方面表现良好,但尚未达到专业设计师水平。
全文
I was a bit suspicious of the claim, but GLM-5.2 is pretty good at designing stuff.
Obviously not at the level of a professional designer, but it has that Opus-level quality.
Great at:
- games
- landing pages
- HTML artifacts
- 3D worlds
Wish I had Fable 5 to compare with. https://t.co/qco7AKIrCv
https://video.twimg.com/amplify_video/2067343441785171969/vid/avc1/1920x1080/4wqAG-uFTv0x18Ia.mp4?tag=28
> **引用原帖 Design Arena (@Designarena):**
> BREAKING: GLM-5.2 is now 1st on Design Arena.
> With an Elo of 1360, GLM-5.2 has jumped ahead of the now unavailable Claude Fable 5.
> And it's open weights.
> This is an improvement of 4 positions and 27 Elo points to achieve one of the highest Elo scores in our code categories since Design Arena started.
> Huge congratulations to the @Zai_org on the release!
> https://x.com/Designarena/status/2066940737011560652
elvis (@omarsar0): YT if you prefer that: https://t.co/4XM1JO0GfS
elvis (@omarsar0): The next set of tasks is going to be on long-running tasks. Very curious how it compares with the frontier models on this. Like, how does it work with /loop and /goal? Reporting back soon.