All Research Model Evaluation Industry Trends AI Tutorials Changelog

secret-keeping

Can Your AI Agent Keep a Secret? Testing Agents for Data Leakage

Capability evals tell you if an agent is smart. They don't tell you whether it will leak the sensitive data it can see. Here's how to test AI agents for data leakage and secret-keeping — grounded in new research and a real-world one-click leak.

06/21/2026 · Model Evaluation · 9 min read