Skip to main content
AI Security & GovernanceMay 15, 2026 · 5 min read

The Execution Layer: Where AI Agent Security Falls Apart

82% of enterprises run unknown AI agents. Security teams lock down models but leave the execution layer — where agents actually do damage — completely ungoverned.

By SpringVanta

Security teams have spent the last two years locking down which AI models employees can use, what data those models can access, and which vendors pass procurement review. That work matters. But across the enterprise, the place where AI agents actually do damage — the execution layer, is running without controls. The execution layer is where an AI agent translates a reasoning step into an action. It calls an API, writes to a database, triggers a workflow, sends an email. Most enterprises have no governance at this layer. Tool invocations are trusted by default. There is no risk scoring before execution, no policy enforcement at the connector level, and no audit trail connecting agent actions to specific identities.

The numbers tell the story

Three surveys released in the first half of 2026 paint a consistent picture:

  • 82% of enterprises have unknown AI agents running in their IT infrastructure, according to a Cloud Security Alliance survey of 418 IT and security professionals. The same survey found that 65% experienced AI agent-related incidents in the past 12 months, resulting in data exposure (61%), operational ion (43%), and financial losses (35%).
  • 88% of organizations reported confirmed or suspected AI agent security incidents in the last year, per Gravitee's State of AI Agent Security report. In healthcare, that figure reaches 92.7%.
  • 82% of executives say they are confident that existing policies protect against unauthorized agent actions, yet only 14.4% send agents to production with full security or IT approval.
    The Confidence-Controls Gap in AI Agent Security
    The gap between confidence and controls is the defining problem of enterprise AI security in 2026. Policy documentation and runtime enforcement are not the same thing.

Why the execution layer gets ignored

When security teams evaluate AI tools, they focus on the model: what it can generate, what data it was trained on, whether it leaks sensitive information. This is the right instinct for chatbots and copilots. But autonomous agents are different. They do not just generate text , they take actions through tool invocations. A prompt injection attack does not need to breach your perimeter. It only needs to manipulate an agent into using a tool it already has access to. An attacker embeds instructions in a document, an email, or an API response. The agent reads the content, interprets the embedded instruction as a legitimate task, and acts on it using real credentials through a real access path. No malware binary, no exploit code, just text. Stanford's Trustworthy AI Research Lab found that model-level guardrails alone are insufficient: fine-tuning attacks bypassed Claude Haiku in 72% of test cases and GPT-4o in 57%. Securing what the model says does not secure what the agent does.

Shadow AI compounds the exposure

Many agents operating inside enterprise environments were deployed by individual teams without security review. They connect to tools, MCP servers, and external APIs that the security team has never mapped or approved. Gravitee found that only 24.4% of organizations have full visibility into which AI agents are communicating with each other. More than half of all agents run without security oversight or logging. The average organization now manages 37 deployed agents, and that number grows every quarter. CSA's survey flagged agent decommissioning as a particular risk: only 21% of organizations have formal decommissioning processes. Decommissioned agents retain permissions and credentials long after their intended use, creating what CSA calls "retirement debt" , a structural exposure that accumulates quietly until something breaks.

The identity problem at the core

Most organizations treat AI agents as extensions of human users, assigning them to shared service accounts or existing credentials. 45.6% of technical teams rely on shared API keys for agent-to-agent authentication, according to Gravitee. When multiple agents share credentials, attribution becomes impossible. Only 21.9% of teams treat AI agents as independent, identity-bearing entities with their own access scopes and audit trails. The organizations that do can attribute actions, scope blast radius, and isolate a compromised agent without taking down entire workflows.

The vendor response is accelerating

Microsoft's Agent 365 May 2026 update is the most visible vendor attempt to close this gap. The platform now includes a centralized registry for all AI agents (Microsoft-built, internal, and third-party), a real-time dashboard for monitoring agent activity, shadow AI detection that integrates with Defender and Intune, admin approval workflows, automated governance rules, conditional access policies for agents, and MCP server management. Cisco's AI Defense solution expanded in February 2026 to add runtime protections against tool abuse and supply chain manipulation at the MCP layer. CrowdStrike has also moved to address execution-layer attacks specifically. These tools are arriving faster than the frameworks meant to guide them. NIST's AI Agent Standards initiative is a multi-year effort with first deliverables expected in late 2026. The EU AI Act and ISO 42001 predate agentic AI entirely. CSA's own AI Controls Matrix is being updated, but the current version was not designed for autonomous agents that create and instruct other agents.

What to do now

If your organization is deploying AI agents , and the data says 80.9% of technical teams have moved past planning into active testing or production, here is what security practitioners recommend: 1. Inventory every agent. You cannot govern what you cannot see. Continuous discovery should cover homegrown automations, SaaS-based agents, browser extensions, and MCP server connections. 2. Treat agents as identities. Give each agent its own credentials, access scopes, and audit trails. Stop sharing service accounts across agents. 3. Gate the execution layer. Deploy controls between agents and their tools. Every tool invocation should be evaluated against policy before execution. High-risk actions should route to human approval. 4. Build decommissioning into the lifecycle. Require credential revocation and access cleanup as part of every agent's retirement process. Retirement debt is real and compounding. 5. Monitor behavior, not just deployment state. Approved agents drift. Behavioral monitoring catches the drift before it becomes an incident. The framework gap is real, but it is not an excuse to wait. The tools for execution-layer security exist today. The question is whether your organization will deploy them before an incident forces the issue.

Sources:

  • Cloud Security Alliance, "Autonomous but Not Controlled" (April 2026)
  • Gravitee, "State of AI Agent Security" (2026)
  • Stanford Trustworthy AI Research Lab (2026)
  • Microsoft Agent 365 May 2026 Update
  • AGAT Software / Token Security research (2026)
Read more

Like this kind of writing?

One email when something good ships — usually once or twice a month.