Artificial Intelligence● neutralImpact 6/10

New benchmark shows Claude Mythos and GPT-5.5 can develop real browser exploits autonomously

The Decoder·May 16, 2026

✦AI Analysis

Researchers at Carnegie Mellon University have developed a benchmark demonstrating that AI agents like Claude Mythos and GPT-5.5 can autonomously create real browser exploits. While Mythos outperforms GPT-5.5 significantly, it comes at a much higher cost, raising questions about the trade-offs between performance and expense in AI development.

Key Topics

Claude MythosGPT-5.5GoogleV8 engine

Originally reported by The Decoder. Read the full article ↗