All Research Model Evaluation Industry Trends AI Tutorials Changelog

data-leakage

Can Your AI Agent Keep a Secret? Testing Agents for Data Leakage

Capability evals tell you if an agent is smart. They don't tell you whether it will leak the sensitive data it can see. Here's how to test AI agents for data leakage and secret-keeping — grounded in new research and a real-world one-click leak.

06/21/2026 · Model Evaluation · 9 min read

Research Agent Data Leakage: Inside the MosaicLeaks Benchmark

Research agent data leakage is a measurable failure mode, not a hypothetical. ServiceNow's MosaicLeaks benchmark shows how deep research agents leak private context through their search queries — and why you can't prompt the problem away.

06/20/2026 · Model Evaluation · 10 min read