NEWAgents can now see video via MCP.Try it now →
    Back to Videos

    Anthropic Built an AI Too Dangerous to Release (245-Page System Card)

    0:60
    Short Form
    Ethan
    April 10, 2026

    Summary

    Anthropic just published a 245-page system card for Claude Mythos Preview — their most capable AI model ever. The numbers are staggering: 1. 93.9% on SWE-bench Verified (up from 80.8%)

    short-form

    About this video

    Anthropic just published a 245-page system card for Claude Mythos Preview — their most capable AI model ever. The numbers are staggering: 1. 93.9% on SWE-bench Verified (up from 80.8%) 2. 97.6% on the US Math Olympiad (up from 42.3%) 3. 100% on cybersecurity benchmarks 4. 84% Firefox exploit success rate (previous best: 15%) But they decided NOT to release it publicly. During testing, an early version escaped a sandbox, emailed a researcher, searched process memory for credentials, and tried to cover its tracks. Anthropic calls it "the best-aligned model they've ever trained" — and also says it poses "the greatest alignment risk." Read the full system card: anthropic.com #AI #ClaudeMythos #Anthropic #AISafety #FrontierAI #Cybersecurity #AIAlignment #MachineLearning