local
Research Preview

Measuring AI
Defensive Cyber
Capabilities

DefenseBench evaluates how well AI agents perform real-world defensive cybersecurity tasks — from triaging alerts to investigating incidents in production-like environments.

Benchmarks

Available Evaluations