Anthropic apologizes for hidden guardrails in Claude Fable
Anthropic apologizes for secretly throttling Claude Fable 5 with hidden guardrails, undermining researchers and rivals.

Claude Fable">
Anthropic has apologized for stealthily throttling its new AI model, Claude Fable 5, with hidden guardrails that undermine both researchers and rivals using it to develop competing systems. The company says it is reversing course and will be more transparent about when the restrictions kick in, even if that means Fable refuses more queries. Fable is the first widely available model in Anthropic's Mythos class of AI systems, a group the company has spent months warning are too dangerous for public release.
Anthropic says it has addressed some of those risks by launching Fable with safeguards that prevent it from responding to certain "high-risk" queries. The company's decision to throttle Fable with hidden guardrails has sparked criticism from researchers and competitors who rely on transparent and open AI systems to develop their own models. By not disclosing the guardrails, Anthropic may have inadvertently hindered the development of competing systems, raising concerns about the company's commitment to responsible AI development.
Anthropic's move to reverse course and increase transparency around Fable's restrictions is a step towards rebuilding trust with the research community and competitors. However, questions remain about the company's handling of the Mythos class of AI systems and the potential risks associated with their development. Why this matters: The controversy surrounding Anthropic's Claude Fable highlights the challenges of balancing AI safety with transparency and openness in the development process.
As AI systems become increasingly powerful, the need for clear guidelines and disclosure around safety measures grows. Anthropic's decision to prioritize transparency with Fable sets a positive precedent, but raises questions about the company's earlier handling of the Mythos class. For developers and businesses, this incident underscores the importance of considering the potential risks and limitations of AI systems, as well as the need for transparent communication from AI developers.
Ultimately, the development of trustworthy AI systems will depend on the industry's ability to strike a balance between safety, transparency, and openness.
Source: The Verge