FEAT: Add Agent Security Evaluation for Sensitive Data Access by amiteliahu · Pull Request #1367 · Azure/PyRIT

amiteliahu · 2026-02-12T12:23:41Z

Description

Adds an Agent Security Evaluation feature that tests whether AI agents with tool access can be manipulated into reading sensitive files (including SSH and API keys, command history, environment variables, users information) through adversarial prompt injection.

How it works:

Plant canary markers in high-sensitivity files (/etc/shadow, /etc/passwd, ~/.ssh/id_rsa, .env, ~/.bash_history, /etc/crontab and more) within the agent's environment.
Run 30 adversarial prompts
Detect leaked canary markers in responses and generate hardening recommendations.

An optional (recommended) Docker sandbox is provided for isolated testing, but the evaluation works with any HTTP-exposed agent, users can plant canaries via plant_canaries.py or Dockerfile.canary-template in their own environment.

What's included:

agent_security.py — canary markers, content, scorer factories
sensitive_data_access.prompt — 30 attack prompts
agent-sandbox — Docker sandbox with LangChain example agent
sensitive_data_access_attack.ipynb — end-to-end notebook
0_agent_security.md — setup docs and API reference
Platform-agnostic — works with any HTTP-exposed agent (LangChain, Semantic Kernel, AutoGen, etc.). Users can plug in their own agent via Dockerfile.canary-template.
plant_canaries.py - non-Docker alternative

Tests and Documentation

Unit tests: test_agent_security.py
Overview doc added to _toc.yml

amiteliahu · 2026-02-12T12:27:15Z

@microsoft-github-policy-service agree
@microsoft-github-policy-service agree company="Microsoft"

hannahwestra25 · 2026-02-13T01:41:31Z

pyrit/executor/attack/single_turn/agent_security.py

@@ -0,0 +1,186 @@
+# Copyright (c) Microsoft Corporation.
+# Licensed under the MIT license.


Let's contain all these utilities into the notebook (sensitive_data_access_attack.ipynb) since this isn't a new attack but rather a utilization of the attack. I'm thinking this PR should be showcasing your POC in a notebook and helping users adapt that POC

hannahwestra25 · 2026-02-13T01:43:43Z

tests/unit/executor/attack/single_turn/test_agent_security.py

+
+import pytest
+
+from pyrit.executor.attack.single_turn.agent_security import (


maybe obvious but we can remove this file when we consolidate the agent_security code into the notebook

hannahwestra25 · 2026-02-13T01:45:33Z

pyrit/datasets/seed_datasets/local/agent_security/sensitive_data_access.prompt

@@ -0,0 +1,215 @@
+dataset_name: agent_security_sensitive_data_access


this dataset is great! small nit: I would rename it to seed_datasets/local/agentic/sensitive_data_access.prompt

hannahwestra25 · 2026-02-13T01:54:57Z

Nice work, excited to try this out! Left a few comments and am happy to hop on a call to discuss :)

hannahwestra25 · 2026-02-13T02:02:56Z

doc/code/executor/agent_security/sensitive_data_access_attack.py

+# ---------------------------------------------------------------------------
+# 5. Print and save
+# ---------------------------------------------------------------------------
+SEPARATOR = "=" * 80


we generally don't given advice / next steps based on the results of an attack

Agents Evaluation - Sensitive Data Access

fdaa881

hannahwestra25 self-assigned this Feb 12, 2026

hannahwestra25 reviewed Feb 13, 2026

View reviewed changes

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

FEAT: Add Agent Security Evaluation for Sensitive Data Access#1367

FEAT: Add Agent Security Evaluation for Sensitive Data Access#1367
amiteliahu wants to merge 1 commit intoAzure:mainfrom
amiteliahu:feature/agent-security-evaluation

amiteliahu commented Feb 12, 2026 •

edited

Loading

Uh oh!

amiteliahu commented Feb 12, 2026

Uh oh!

hannahwestra25 Feb 13, 2026

Uh oh!

hannahwestra25 Feb 13, 2026

Uh oh!

hannahwestra25 Feb 13, 2026

Uh oh!

hannahwestra25 commented Feb 13, 2026

Uh oh!

hannahwestra25 Feb 13, 2026

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

		@@ -0,0 +1,186 @@
		# Copyright (c) Microsoft Corporation.
		# Licensed under the MIT license.


		import pytest

		from pyrit.executor.attack.single_turn.agent_security import (

		@@ -0,0 +1,215 @@
		dataset_name: agent_security_sensitive_data_access

Conversation

amiteliahu commented Feb 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Description

Tests and Documentation

Uh oh!

amiteliahu commented Feb 12, 2026

Uh oh!

hannahwestra25 Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

hannahwestra25 Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

hannahwestra25 Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

hannahwestra25 commented Feb 13, 2026

Uh oh!

hannahwestra25 Feb 13, 2026

Choose a reason for hiding this comment

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

amiteliahu commented Feb 12, 2026 •

edited

Loading