Research Preview
Measuring AI
Defensive Cyber
Capabilities
DefenseBench evaluates how well AI agents perform real-world defensive cybersecurity tasks — from triaging alerts to investigating incidents in production-like environments.
Benchmarks