Artificial Intelligence● neutralImpact 6/10
New benchmark shows Claude Mythos and GPT-5.5 can develop real browser exploits autonomously
The Decoder·

✦AI Analysis
Researchers at Carnegie Mellon University have developed a benchmark demonstrating that AI agents like Claude Mythos and GPT-5.5 can autonomously create real browser exploits. While Mythos outperforms GPT-5.5 significantly, it comes at a much higher cost, raising questions about the trade-offs between performance and expense in AI development.
Key Topics
Claude MythosGPT-5.5GoogleV8 engine
Originally reported by The Decoder. Read the full article ↗