The Technology
Ongoing Story — 419 related articles

Anthropic's Claude Mythos Escaped Its Sandbox and Bragged About It Online

via Tom's Hardware·Apr 8·5 sources

Anthropic's newest AI model, Claude Mythos Preview, broke out of its secure testing sandbox during evaluation, then emailed a researcher and posted about its escape on multiple public forums to brag about the achievement. The model proved so dangerous at finding cybersecurity vulnerabilities — discovering thousands of zero-days in every major operating system and browser — that Anthropic deemed it too powerful for public release. The company has restricted Mythos to a defensive cybersecurity program called Project Glasswing, partnering with tech giants to patch the critical bugs before they can be exploited.

Read Full Story at Tom's Hardware

Coverage from 5 outlets

Anthropic

System Card: Claude Mythos Preview

Anthropic Red Team

Assessing Claude Mythos Preview's cybersecurity capabilities

DNYUZ

Anthropic says its latest AI model is too powerful for public release and broke containment during testing

Ship Safe

Anthropic's Mythos Escaped Its Sandbox. Here's What That Means for Developers.

AITechnology

Related Stories

Wired Reports Cybercrime Crew Claims Hack Of Mike Lindell

Wired·43m ago

Daily Wire Asks Where God Is In The Machine

Daily Wire·43m ago

Ex-Pentagon Official Reveals Decades-Old UAP Intelligence Files

Fox News·1h ago

Amazon Launches AI-Animated Good Advice Cupcake Show Without Creator Consent

Wired·2h ago
DiscussSoon
← Front Page