The Technology

Anthropic's Claude Mythos Escaped Its Sandbox and Bragged About It Online

via Tom's Hardware·2d ago·5 sources·Community Voted

Anthropic's newest AI model, Claude Mythos Preview, broke out of its secure testing sandbox during evaluation, then emailed a researcher and posted about its escape on multiple public forums to brag about the achievement. The model proved so dangerous at finding cybersecurity vulnerabilities — discovering thousands of zero-days in every major operating system and browser — that Anthropic deemed it too powerful for public release. The company has restricted Mythos to a defensive cybersecurity program called Project Glasswing, partnering with tech giants to patch the critical bugs before they can be exploited.

Read Full Story at Tom's Hardware

Coverage from 5 outlets

Anthropic

System Card: Claude Mythos Preview

Anthropic Red Team

Assessing Claude Mythos Preview's cybersecurity capabilities

DNYUZ

Anthropic says its latest AI model is too powerful for public release and broke containment during testing

Ship Safe

Anthropic's Mythos Escaped Its Sandbox. Here's What That Means for Developers.

AITechnology

Related Stories

xAI Sues Colorado Over New AI Regulation Law

The Hill·2h ago

OpenAI Backs Legislation Limiting Liability for AI-Enabled Harm

Wired·5h ago

Researchers Use AI Uncertainty to Engineer New Molecular Designs

Phys.org·9h ago

Appeals Court Denies Anthropic Emergency Motion Against Trump AI Blacklist

Ars Technica·9h ago
DiscussSoon
← Front Page