AI 精选动态
智能评分 65
Anthropic 重新上线 Claude Fable 5 并加强安全限制
AI 推荐理由
值得关注后续安全分类器误报率调整及行业框架细节。核心解读
Anthropic 重新上线 Claude Fable 5,加入新分类器以阻止网络安全任务,短期编码和调试任务回退至 Opus 4.8;同时联合亚马逊、微软、谷歌起草评估 AI 越狱严重性的框架,并扩大与美国政府在模型测试和安全措施上的合作。
全文
So uh, Fable will be useless?
Apparently coding work will fall back to Opus. https://t.co/igcGeHSvFG

> **引用原帖 Anthropic (@AnthropicAI):**
> Claude Fable 5 will be available again globally tomorrow.
> After a series of productive conversations with the US government, we're redeploying the model with a new set of classifiers to target and block more cybersecurity tasks. In the near term, some routine tasks like coding and debugging will fall back to Opus 4.8. We’ll continue to refine these classifiers over the coming weeks to reduce false positives and better distinguish genuine misuse from legitimate requests.
> We’ve also begun drafting a consensus framework—with Amazon, Microsoft, Google, and other Glasswing partners—for assessing the severity of AI jailbreaks and how AI developers should respond to them. We invite other industry partners and model providers to join us in this effort.
> Finally, we’re scaling up our collaboration with the US government on model testing and safeguards. This will include pre-release access to models and safeguards for evaluation, information sharing on jailbreaks and misuse, and dedicated resources for joint research.
> Thank you to our users for your patience, and to our partners across the government, industry, and the research community who worked alongside us to make Fable 5 available again.
> Read our full blog: https://t.co/VHyum831ri
> https://x.com/AnthropicAI/status/2072163884430229756