AI SecurityApr 10, 2026

Can Anthropic Keep Its Exploit-Writing AI Out of the Wrong Hands?

Anthropic releases Mythos Preview model capable of finding and exploiting zero-days with built-in safeguards.

Summary

Anthropic has introduced Mythos Preview, an AI model designed to identify and exploit critical zero-day vulnerabilities. The company has implemented controls to restrict misuse of the exploit-writing capability. The release raises questions about balancing security research benefits against potential dual-use risks.

Entities

Anthropic (vendor)Mythos Preview (product)AI exploit-writing model (technology)