返回精选
AI 精选动态 智能评分 65

OpenAI introduces GPT-5.6 model suite with Sol, Terra, Luna

来源: twitter关注列表
作者: Rohan Paul (@rohanpaul_ai)
发布于: 2026-06-26
收录于: 2026-06-26
AI 推荐理由
新增信息:美国政府要求限制预览,且OpenAI披露了Sol在网络安全上未跨过内部阈值,这是以往少见的透明度。
核心解读
OpenAI发布GPT-5.6模型套件预览,包括旗舰模型Sol、中档Terra和廉价Luna。美国要求先进行小规模可信预览。Sol在agentic任务和Terminal-Bench 2.1上优于GPT-5.5,网络安全能力增强但未达内部Cyber Critical阈值。定价Sol为$5/1M输入token、$30/1M输出token,安全测试使用70万A100等效GPU小时。
全文
BREAKING: OpenAI just dropped the limited preview of its new GPT 5.6 model suite: Sol, the flagship; Terra, a medium-tier model for “high-volume work”; and Luna, a “fast and affordable” everyday model. The most revealing part is the release gate: OpenAI says the U.S. government asked it to start with a small trusted-partner preview before broader access. Sol is the flagship model, and OpenAI claims it is a step above GPT-5.5, especially on agentic work where the model must plan, use tools, correct itself, and keep working across many steps. Terminal-Bench 2.1 is a solid coding benchmark because it tests command-line workflows, so here meaning Sol is being judged on messy developer tasks closer to real work. ---- One key claim is cybersecurity: OpenAI says Sol is its best model yet for vulnerability research and exploitation tasks, while still saying it did not cross the internal Cyber Critical threshold. “GPT‐5.6 is trained to refuse prohibited cyber assistance, including when users attempt to disguise their intent or jailbreak the model.” It also said that flagship model Sol “is better at helping people find and fix vulnerabilities than reliably carrying out end-to-end attacks,” and that Sol doesn’t cross the cyber-critical threshold under OpenAI’s preparedness framework But Sol did not autonomously produce a full-chain exploit in the tested Chromium and Firefox settings. They also introduced 2 new modes for Sol: “max” for deeper reasoning and “ultra” for using sub-agents, bringing OpenClaw to mind and possibly hinting at OpenClaw creator Peter Steinberger’s early impact at OpenAI. ---- Pricing: GPT-5.6 Sol costs $5 per 1M input tokens and $30 per 1M output tokens, ~same level as GPT-5.5. Terra is positioned near GPT-5.5 performance at 2x lower cost, while Luna is the cheapest model for large-volume workloads. -- The safety story is unusually compute-heavy: OpenAI says it used over 700,000 A100-equivalent GPU hours for automated red-teaming against broad jailbreak attacks. Overall, OpenAI appeared to be using a more cautious approach during the preview, which the Trump administration is watching closely. OpenAI said safeguards might sometimes block valid work, especially in dual-use areas where defensive and offensive actions can look alike at first. That is one thing the preview is meant to test. ![photo](https://pbs.twimg.com/media/HLwlY7ubsAA27vh.png) > **引用原帖 OpenAI (@OpenAI):** > Introducing a limited preview of GPT-5.6 Sol, our next generation frontier model, as well as GPT-5.6 Terra, a balanced model for efficient, everyday work, and GPT-5.6 Luna, a fast and affordable model for high-volume work. > https://t.co/OoM83SyISN > https://x.com/OpenAI/status/2070555272230384038 Rohan Paul (@rohanpaul_ai): Sol delivers near-frontier cyber-exploitation capability much more efficiently. GPT-5.6 Sol reaches roughly 70% on ExploitBench with about 120K output tokens, far above GPT-5.5 and the cheaper GPT-5.6 models. Mythos Preview scores slightly higher, but it uses roughly 3x more tokens,
#AI#模型发布#AI模型