返回精选
AI 精选动态 智能评分 62

Ornith-1.0 开源编码专用 LLM 发布

来源: twitter关注列表
作者: AK (@_akhaliq)
发布于: 2026-06-30
收录于: 2026-07-01
AI 推荐理由
差异点:该模型创新性地通过 RL 同时优化代码生成 scaffold 和解决方案,在多个 agentic coding 基准上刷新开源 SOTA。
核心解读
Ornith 发布 Ornith-1.0 系列开源 LLM,参数规模包括 9B Dense、31B Dense、35B MoE 和 397B MoE,在多个编码基准上达到开源 SOTA,例如 SWE-Bench Verified 82.4、Terminal-Bench 2.1 77.5。模型基于 gemma4 和 qwen3.5 后训练,采用联合优化 scaffold 和解决方案的强化学习策略,全部 MIT 许可证发布。
全文
AK (@_akhaliq) 转发了 Ornith (@ornith_) 的帖子: 🐦Chirp chirp! Ornith-1.0-35B is now available in 🤗 HuggingFace Claude! 🤗Come and push Ornith on the swing ! 🔗https://t.co/GStiFdDueD ![photo](https://pbs.twimg.com/media/HMGVKq_a0AAuMA8.jpg) > **引用原帖 Ornith (@ornith_):** > Aloha! 🌺 Meet Ornith-1.0, a family of open-source LLMs specialized for agentic coding. > Ornith-1.0 spans the full parameter sizes including 9B Dense, 31B Dense, 35B MoE, and 397B MoE. It achieves state-of-the-art performance among open-source models of comparable size on coding benchmarks including: > ✅Terminal-Bench 2.1(77.5) > ✅SWE-Bench(82.4 on verified, 62.2 on pro, 78.9 on Multilingual) > ✅NL2Repo(48.2) > ✅SWE Atlas(41.2 on QnA, 42.6 RF, 39.1 TW) > ✅ClawEval(77.1) > Post-trained on top of gemma4 and qwen3.5, Ornith-1.0 employs a novel self-improving training strategy in which reinforcement learning is used to generate not only solution rollouts, but also the task-specific scaffolds that drive those rollouts. By jointly optimizing the scaffold and the resulting solution, the model generate higher-quality solutions in agentic coding.😎 > All models are released under the MIT license, enabling full commercial and research use. > 📖Tech Blog: https://t.co/qT9N2HYWFn > 🤗Huggingface: https://t.co/PRrwqjeBtM > https://x.com/ornith_/status/2070148887067963854
#模型发布#开源#技术突破