AI 精选动态
智能评分 80
OpenAI 发布 GPT-5.6 模型系列:Sol、Terra、Luna
AI 推荐理由
差异点:文章披露了 Sol 在网络安全评估中未达到内部关键阈值,以及美国政府参与预览审批的具体机制,值得原文了解细节。核心解读
OpenAI 发布 GPT-5.6 模型系列,包括旗舰模型 Sol、中端模型 Terra 和低成本模型 Luna。Sol 在 Agent 工作方面超越 GPT-5.5,定价 $5/百万输入 tokens 和 $30/百万输出 tokens。安全测试使用 70 万 A100 等效 GPU 小时进行自动红队测试,Sol 在网络安全方面未达到内部关键阈值。美国政府要求初期仅限可信合作伙伴预览。
全文
OpenAI wrote in their GPT-5.6 official blog post today.
On Trump administration's selective approval process of new model release. https://t.co/XsYgTEpFFY

> **引用原帖 Rohan Paul (@rohanpaul_ai):**
> BREAKING: OpenAI just dropped the limited preview of its new GPT 5.6 model suite: Sol, the flagship; Terra, a medium-tier model for “high-volume work”; and Luna, a “fast and affordable” everyday model.
> The most revealing part is the release gate: OpenAI says the U.S. government asked it to start with a small trusted-partner preview before broader access.
> Sol is the flagship model, and OpenAI claims it is a step above GPT-5.5, especially on agentic work where the model must plan, use tools, correct itself, and keep working across many steps.
> Terminal-Bench 2.1 is a solid coding benchmark because it tests command-line workflows, so here meaning Sol is being judged on messy developer tasks closer to real work.
> ----
> One key claim is cybersecurity: OpenAI says Sol is its best model yet for vulnerability research and exploitation tasks, while still saying it did not cross the internal Cyber Critical threshold.
> “GPT‐5.6 is trained to refuse prohibited cyber assistance, including when users attempt to disguise their intent or jailbreak the model.” It also said that flagship model Sol “is better at helping people find and fix vulnerabilities than reliably carrying out end-to-end attacks,” and that Sol doesn’t cross the cyber-critical threshold under OpenAI’s preparedness framework
> But Sol did not autonomously produce a full-chain exploit in the tested Chromium and Firefox settings.
> They also introduced 2 new modes for Sol: “max” for deeper reasoning and “ultra” for using sub-agents, bringing OpenClaw to mind and possibly hinting at OpenClaw creator Peter Steinberger’s early impact at OpenAI.
> ----
> Pricing: GPT-5.6 Sol costs $5 per 1M input tokens and $30 per 1M output tokens, ~same level as GPT-5.5.
> Terra is positioned near GPT-5.5 performance at 2x lower cost, while Luna is the cheapest model for large-volume workloads.
> --
> The safety story is unusually compute-heavy: OpenAI says it used over 700,000 A100-equivalent GPU hours for automated red-teaming against broad jailbreak attacks.
> Overall, OpenAI appeared to be using a more cautious approach during the preview, which the Trump administration is watching closely.
> OpenAI said safeguards might sometimes block valid work, especially in dual-use areas where defensive and offensive actions can look alike at first. That is one thing the preview is meant to test.
> https://x.com/rohanpaul_ai/status/2070573957271732353