Can Your AI Agent Keep a Secret? Testing Agents for Data Leakage
Capability evals tell you if an agent is smart. They don't tell you whether it will leak the sensitive data it can see. Here's how to test AI agents for data leakage and secret-keeping — grounded in new research and a real-world one-click leak.
06/21/2026 · Model Evaluation · 9 min read