AI SecurityJun 9, 2026

Anthropic Launches Claude Fable 5: Mythos-Class AI With Cybersecurity Guardrails

Anthropic launches Claude Fable 5 AI with cybersecurity guardrails, restricting use in sensitive domains.

Summary

Anthropic has released Claude Fable 5, a powerful AI model with built-in safeguards designed to prevent misuse in sensitive areas like cybersecurity. The model automatically defaults to a less capable version for high-risk domains, though early data shows minimal fallback. Anthropic emphasizes its rigorous internal and external testing against adversarial attempts to bypass these safety measures.

Full text

Anthropic on Tuesday announced the general availability of Claude Fable 5, a powerful Mythos-class AI model engineered with new safeguards that specifically restrict its use in high-risk domains, including cybersecurity. The AI giant says this marks the first time a model of this capability class has been deemed safe enough for widespread public and developer access. While Fable 5 demonstrates good performance — surpassing prior models in software engineering, knowledge work, vision, and long-running tasks — the company prioritized safety by implementing targeted blocks. In sensitive areas such as cybersecurity and biology, the model automatically falls back to the less capable Claude Opus 4.8 to prevent potential misuse. Early usage data indicates that at least 95% of sessions run entirely on Fable 5’s capabilities without triggering any fallback. “The uplift from Mythos-level capabilities is valuable to many adversaries — for instance, those who could financially gain from cyberattacks — and we therefore expect them to be motivated to try to circumvent our safety measures,” Anthropic noted. The company emphasized the rigor of its safety measures. It conducted extensive internal red-teaming of its classifiers, followed by an external bug bounty program spanning over 1,000 hours that yielded no universal jailbreaks. Advertisement. Scroll to continue reading. Independent external red-teaming also failed to uncover critical bypasses, underscoring the robustness of the safeguards against adversarial attempts to achieve restricted outputs. Project Glasswing partners gain upgraded Mythos 5 Anthropic also announced on Tuesday that trusted users, including its cybersecurity partners in Project Glasswing, are being upgraded from Claude Mythos Preview to Claude Mythos 5. The company plans to gradually expand this high-privilege access through a structured trusted-access program. Anthropic announced recently that it’s expanding Project Glasswing to add roughly 150 new organizations. The AI giant has not listed the new additions, but several cybersecurity and tech companies have since announced their participation in the project, including Dragos, Tenable, TrendAI (Trend Micro), Netskope, BeyondTrust, Rubrik, BT, Intercontinental Exchange, and Hitachi. Both Fable 5 and Mythos 5 are priced at $10 per million input tokens and $50 per million output tokens, with the former available immediately via the Claude API for developers. Related: Claude Mythos Turns N-Days Into N-Hours With Rapid Exploit Creation Related: New Platform Uses Cryptographic Invisibility to Protect AI-Built Applications Related: Will AI Kill the Bug Bounty Industry? Written By Eduard Kovacs Eduard Kovacs (@EduardKovacs) is senior managing editor at SecurityWeek. He worked as a high school IT teacher before starting a career in journalism in 2011. Eduard holds a bachelor’s degree in industrial informatics and a master’s degree in computer techniques applied in electrical engineering. Daily Briefing Newsletter Subscribe to the SecurityWeek Email Briefing for the latest cybersecurity threats, trends, and expert insights. More from Eduard Kovacs WhatsApp Catches Spyware Firm NSO Defying No-Hacking Court OrderCybersecurity M&A Roundup: 26 Deals Announced in May 2026OpenAI Rolling Out ChatGPT Account Security ControlsMeta Says 20,000 Instagram Accounts Hacked via AI Tool AbuseNightclub Giant RCI Says Data Breach Affects 40,000 IndividualsCisco Warns of 7th SD-WAN Zero-Day Exploited in 2026Gemini Voice Assistant Hijacked via Messaging NotificationsVS Code Vulnerability Allows One-Click GitHub Token Theft Latest News OpenSSL Patches High-Severity Vulnerability Found With AIClaude Mythos Turns N-Days Into N-Hours With Rapid Exploit CreationNew Platform Uses Cryptographic Invisibility to Protect AI-Built ApplicationsSAP Patches Critical NetWeaver, Commerce VulnerabilitiesOver 100 NPM, PyPI Packages Hit in New Shai-Hulud Supply Chain AttacksWill AI Kill the Bug Bounty Industry?Check Point VPN Zero-Day Exploited in Qilin Ransomware AttacksGoogle Patches 5th Chrome Zero-Day Exploited in 2026 Trending Daily Briefing NewsletterSubscribe to the SecurityWeek Email Briefing to stay informed on the latest threats, trends, and technology, along with insightful columns from industry experts. Webinar: Third-Party Risk in Practice June 4, 2026 Organizations are investing heavily in third-party risk management, but breaches, delays, and blind spots continue to persist. Join this live webinar as we examine the gap between how organizations think their third-party risk programs are performing and what’s actually happening in practice. Register Virtual Roundtable: CISO Forum 2026 Mid-Year Review June 10, 2026 Explore how attackers are using AI to scale threats and how security teams can respond with AI-driven defenses. Protecting against unmonitored use of generative AI (Shadow AI) in business units and building and enforcing AI governance frameworks. Register People on the MoveOpal Security has appointed CPO, CTO, VP of Field Engineering, VP of Marketing, and Head of Product and Solutions Marketing.The Department of the Air Force has appointed Ashley Devoto as Chief Information Officer.Bartley Richardson has been named Chief AI and Autonomous Systems Officer at CrowdStrike.More People On The MoveExpert Insights Everybody Is Vibe Coding But Nobody Told the Security Team AI-driven development is not something organizations can or should block. But it must be governed. (Danelle Au) The Zero-Knowledge Threat Actor and the End of Responsible Disclosure AI can help attackers generate malware, create malicious payloads, bypass simple security checks, and convert vague malicious intent into functional code. (Etay Maor) Raising the Cybersecurity Stakes: Ante up for the Agentic Era CISOs are now facing machine-speed attacks and asking, “How do I agent?” The industry must provide remediation at scale. (Nadir Izrael) Caught Off Guard: Securing AI After It Hits Production As enterprises rush AI projects into production, security teams are increasingly being forced into reactive mode. (Joshua Goldfarb) Cyber Resilience is the New Business Continuity Plan The organizations best prepared to face disruption are those that align security, continuity and risk management around what the business cannot afford to lose. (Steve Durbin) Flipboard Reddit Whatsapp Whatsapp Email

Entities

Claude Fable 5 (product)Claude Opus 4.8 (product)Anthropic (vendor)Claude Mythos Preview (product)Claude Mythos 5 (product)Claude API (product)