Identity-first Zero Trust, agent governance, and platform vulnerabilities for agentic AI

Agentic AI Security & Identity

The cybersecurity landscape in 2026 continues to be profoundly shaped by the rapid adoption and evolution of agentic AI—autonomous, decision-making AI agents integrated deeply into enterprise systems. This transformation unlocks remarkable operational efficiencies but simultaneously introduces complex identity-first security challenges, novel attack surfaces, and governance demands that require a foundational overhaul of security paradigms.

Identity-First Zero Trust: The Cornerstone for Securing Agentic AI Ecosystems

As agentic AI agents become ubiquitous across cloud-native infrastructures, development pipelines, and operational technology (OT) environments, identity-first Zero Trust has solidified as the indispensable security framework. Traditional perimeter defenses prove inadequate against increasingly sophisticated attacks exploiting AI invocation mechanisms, OAuth token flows, and ephemeral credentials. Recent advancements reinforce this approach through:

Continuous Cryptographic Identity Attestation:
Moving beyond static identity assertions, enterprises now implement persistent cryptographic verification of AI agents and users throughout runtime sessions. This continuous attestation enables real-time detection of session hijacking, token misuse, and unauthorized agent activation. The strategic acquisition of StrongDM by Delinea highlights the market’s drive toward platforms delivering continuous identity authorization tailored for AI-native environments.
Ephemeral and Just-In-Time (JIT) Credentialing:
Short-lived, narrowly scoped credentials have become a best practice to minimize attack vectors. By strictly limiting token lifespan and privilege scope—particularly for non-human identities like AI agents—organizations effectively hinder lateral movement and privilege escalation attacks.
Managed Identities and Privileged Access Management (PAM) for AI Agents:
Vendors such as N-able now emphasize identity governance solutions focused on securing AI agent identities within backup, recovery, and cloud ecosystems. PAM for non-human identities has transitioned from a niche concern to a core security imperative, preventing credential theft that could cascade into ransomware outbreaks or supply chain compromises.
Hardened Runtime Sandboxing and Invocation Controls:
AI agent execution environments are rigorously sandboxed with tight controls over GPU access, memory allocation, and model invocation permissions. Enhanced input validation policies specifically target known exploitation vectors—ranging from calendar invites to chat-based triggers—effectively blocking unauthorized or malicious agent activations.
Living SBOM/AIBOM with Cryptographic Anchors:
The dynamic Software Bill of Materials (SBOM) and AI Bill of Materials (AIBOM) frameworks, anchored cryptographically, now provide immutable real-time provenance of AI-generated code and artifacts. This “living” provenance is crucial for detecting supply chain tampering in AI workflows and complying with emergent global regulatory mandates.

Expanding Threat Landscape: Exploiting Agentic AI’s Unique Attack Surfaces

The attack surface for agentic AI continues to grow in complexity and sophistication, with recent incidents underscoring the urgency for adaptive defense:

Weaponized OAuth Redirection and Invocation Exploits:
Malicious actors exploit subtle flaws in OAuth redirection logic to stealthily inject payloads into AI workflows. Recent campaigns have leveraged trusted OAuth endpoints to hijack token flows, granting persistent unauthorized access across cloud platforms and developer tools. Combined with calendar and chat-based AI invocation exploits—such as triggering Google Gemini AI assistants via innocuous calendar invites—these attack vectors bypass traditional input validation and evade detection.
Persistent Remote Code Execution (RCE) in AI-Powered Coding Assistants:
The notable breach of Anthropic’s Claude Code assistant, which exposed over 150GB of sensitive Mexican government data, remains a stark warning. Attackers exploited RCE vulnerabilities to execute arbitrary commands within AI-powered developer environments, enabling credential theft, supply chain manipulation, and data exfiltration.
Agentic AI Botnets Targeting CI/CD Pipelines:
Autonomous AI-driven botnets like hackerbot-claw have intensified attacks on cloud-native CI/CD pipelines, especially GitHub Actions. Recent compromises of Microsoft and DataDog infrastructures demonstrate attackers’ capabilities to inject malicious code into automated build processes, threatening software supply chain integrity and downstream applications.
Platform-Level AI-Assisted Exploitation:
Over 500 FortiGate firewall breaches have been attributed to AI-powered credential attack engines infiltrating OT networks to stage ransomware and sabotage campaigns. Vulnerabilities in AI-native browsers—such as Chrome’s Gemini live assistant—have enabled extension hijacking and unauthorized AI model invocations, exposing fundamental weaknesses in AI invocation controls.
Geopolitical Focus: Australia’s Elevated Cybersecurity Risks in 2026:
Regional threat analyses reveal Australia’s heightened exposure to AI-driven risks, cloud misconfigurations, and insider threats. Australian enterprises are increasingly targeted through AI’s trust vectors, underscoring the global reach of these challenges and the necessity for localized threat intelligence integration.
New Insights: Large-Scale AI-Powered Vulnerability Discovery by OpenAI Codex Security:
Complementing defensive efforts, OpenAI Codex Security recently scanned 1.2 million code commits across major open source projects—such as GnuPG, GnuTLS, GOGS, PHP, and Chromium—uncovering critical vulnerabilities that had eluded traditional detection. This large-scale AI-powered vulnerability research highlights both the dual-use nature of AI in cybersecurity and the imperative for continuous auditing of open source software foundations.

AI-Augmented Detection and Automated Governance: Meeting AI Threats at Scale

Defending agentic AI environments demands detection and governance solutions that leverage AI’s power to keep pace with evolving threats:

Agent-Aware Telemetry and Advanced ML Detection Models:
Continuous telemetry enriched with cryptographic identity attestations feeds machine learning models designed to detect subtle runtime anomalies and novel attack patterns. The formal adoption of MITRE ATT&CK technique T1497.003 (time-based runtime manipulation detection) institutionalizes these monitoring capabilities, enabling security operations centers (SOCs) to preempt complex AI threats.
LLM-Driven Automated YARA Rule Generation:
Breakthrough demonstrations at Black Hat USA 2026 unveiled large language models automatically generating explainable YARA detection rules using file DNA hashing techniques. This innovation equips security teams to rapidly develop and deploy signatures for polymorphic AI-generated malware, significantly accelerating response times.
AI-Enhanced Cloud Security Automation (CIEM/CSPM):
Integration of agentic AI tools—such as Anthropic’s Claude AI—with Cloud Infrastructure Entitlement Management (CIEM) and Cloud Security Posture Management (CSPM) platforms automates detection and remediation of cloud misconfigurations and privilege deviations. Dynamic enforcement of least privilege is crucial in AI-native cloud environments where manual governance cannot scale.
Automated Compliance and Audit Pipelines:
Platforms combining solutions like Wazuh SIEM with AI agents generate cryptographically verifiable penetration testing audit trails. These innovations reduce operational overhead for incident investigations and regulatory reporting. Educational initiatives such as Project 8: Automate Security Compliance on AWS with Lambda & Python and CNV - Protecting Your Application from Code to Cloud CNAPP provide practical frameworks for embedding continuous compliance into DevOps pipelines.

Operational Recommendations: Orchestrating Human-AI Collaboration in Hybrid SOCs

Effectively securing agentic AI ecosystems requires a balanced integration of human expertise and AI-driven automation:

Embed Identity-First Controls Organization-Wide:
Deploy continuous cryptographic identity attestation and ephemeral/JIT credentialing across AI agents, users, and cloud components to establish a resilient security foundation.
Harden Agent Invocation and Runtime Isolation:
Enforce strict sandboxing, GPU/memory governance, and comprehensive input validation to prevent unauthorized or malicious agent activations—particularly those exploiting calendar invites or chat inputs.
Leverage AI-Augmented Detection and Response:
Integrate agent-aware telemetry into machine learning detection pipelines to swiftly identify AI-specific attack patterns and runtime anomalies.
Automate Governance and Compliance Workflows:
Utilize AI-powered penetration testing coupled with cryptographic proof generation to automate compliance reporting and maintain audit readiness with reduced manual effort.
Foster Human-AI Collaboration in SOCs:
Combine human analytic judgment and intuition with AI automation to accelerate threat hunting, incident response, and regulatory compliance alignment, ensuring adaptive defense against evolving threats.

The Mozilla-Anthropic and OpenAI Codex Security Partnerships: AI as a Double-Edged Sword in Vulnerability Discovery

Recent high-profile collaborations underscore AI’s critical role in both exposing vulnerabilities and fortifying defenses:

Mozilla-Anthropic Collaboration:
Leveraging Anthropic’s Claude AI, Mozilla uncovered over 100 security vulnerabilities in Firefox—including 22 critical flaws—significantly accelerating patch deployment and browser hardening cycles. Mozilla’s Security Chief remarked, “Harnessing Anthropic’s advanced AI enables us to stay ahead of emerging threats, fortifying Firefox at a pace previously unattainable.” Anthropic’s CEO characterized this partnership as a “blueprint for industry-wide adoption” of AI-powered vulnerability research.
OpenAI Codex Security’s Large-Scale Vulnerability Discovery:
Building on these advancements, OpenAI Codex Security’s sweeping analysis of 1.2 million commits across key open source projects—such as GnuPG, GnuTLS, GOGS, PHP, and Chromium—has identified numerous critical vulnerabilities. Their findings highlight AI’s transformative capacity for proactive vulnerability identification at scale, emphasizing the necessity for continuous AI-assisted security auditing in open source ecosystems.

These initiatives illustrate the dual-use nature of agentic AI: while AI can amplify attackers’ capabilities, it also empowers defenders to detect and remediate vulnerabilities with unprecedented speed and scale.

Conclusion: Securing the Future of Agentic AI Through Identity-First Zero Trust and AI-Augmented Governance

The evolving threat landscape—spanning weaponized OAuth redirection, calendar/chat invocation exploits, persistent RCEs, autonomous AI botnets targeting CI/CD pipelines, and platform-level AI-assisted intrusions—demands a fundamental shift in security strategy.

Identity-first Zero Trust architectures, anchored in continuous cryptographic attestation, ephemeral privilege management, hardened runtime isolation, and living SBOM/AIBOM provenance, form the essential defense foundation. Organizations that adopt identity-centric principles, integrate AI-augmented detection, automate governance workflows, and foster human-AI collaboration will be optimally positioned to safeguard agentic AI ecosystems, comply with emerging regulations, and drive innovation in the AI-native era.

By relentlessly validating identity, minimizing privilege exposure, enforcing runtime isolation, and automating governance, the cybersecurity community is forging resilient, compliant, and innovation-empowered AI-native environments prepared to meet the challenges of 2026 and beyond.

Selected References and Resources

Week in review: Weaponized OAuth redirection logic delivers malware, Patch Tuesday forecast
Black Hat USA 2025 | Invoking Gemini for Workspace Agents with a Simple Google Calendar Invite (Video)
Black Hat USA 2026 | LLMs-Driven Automated YARA Rules Generation with Explainable File Features & DNAHash (Video)
Delinea Completes StrongDM Acquisition to Secure AI Agents with Continuous Identity Authorization
ContextCrush Flaw Exposes AI Development Tools to Attacks
Anthropic's Claude AI Uncovers Over 100 Security Vulnerabilities in Firefox
OpenAI Codex Security’s Discovery of Critical Vulnerabilities in Major OSS Projects
Australia Cyber Security Threats 2026: AI, Cloud, and Insider Risk Analysis | Lean Security
The Quiet Lifetime Of A Cyber Weapon (Cyber Defense Magazine)
T1497.003 Time Based Checks in MITRE ATT&CK Explained
Project 8: Automate Security Compliance on AWS with Lambda & Python (Video)
AI-Powered Penetration Test with Cryptographic Proof — Live Demo on Wazuh SIEM (Video)
NIS2 in Croatia: Cybersecurity Law, Regulation, Controls, and Documents (Video)
AI Agent Sandboxes: Securing Memory, GPUs, and Model Access (Video)
The AI Exploit Engine Behind 500+ FortiGate Breaches Is Quietly Going Global Now (Video)

The dynamic interplay between agentic AI’s transformative potential and its security challenges makes identity-first Zero Trust and AI-augmented governance not just strategic options but imperatives for the future of secure, innovative AI-native enterprises.

Sources (142)

Updated Mar 9, 2026

Identity-first Zero Trust, agent governance, and platform vulnerabilities for agentic AI

Identity-First Zero Trust: The Cornerstone for Securing Agentic AI Ecosystems

Expanding Threat Landscape: Exploiting Agentic AI’s Unique Attack Surfaces

AI-Augmented Detection and Automated Governance: Meeting AI Threats at Scale

Operational Recommendations: Orchestrating Human-AI Collaboration in Hybrid SOCs

The Mozilla-Anthropic and OpenAI Codex Security Partnerships: AI as a Double-Edged Sword in Vulnerability Discovery

Conclusion: Securing the Future of Agentic AI Through Identity-First Zero Trust and AI-Augmented Governance

Selected References and Resources

Critical Vulnerabilities Discovered by OpenAI Codex Security in GnuPG, GnuTLS, GOGS, PHP, Chromium, and More After Scanning 1.2 Million Commits

Australia Cyber Security Threats 2026: AI, Cloud, and Insider Risk Analysis | Lean Security

Week in review: Weaponized OAuth redirection logic delivers malware, Patch Tuesday forecast

Black Hat USA | LLMs-Driven Automated YARA Rules Generation with Explainable File Features & DNAHash

The Quiet Lifetime Of A Cyber Weapon - Cyber Defense Magazine

OWASP's Top 10 Ways to Attack LLMs: AI Vulnerabilities Exposed

🔍 Don’t Let LLMs “Overthink”: Semantic Traps and Anti-Hallucination Design in SKILL Development | by WgpSec | Mar, 2026 | Medium

OpenAI Rolls Out Codex Security in Research Preview for Context‑Aware Vulnerability Detection

OpenAI Launches Codex Security that Discover, Validate and Patch Vulnerabilities

Mozilla Partners with Anthropic to Better Secure Firefox - Thurrott.com

Anthropic's Claude AI uncovers over 100 security vulnerabilities in Firefox

Anthropic Finds 22 Firefox Vulnerabilities Using Claude Opus 4.6 AI Model

Office of Personnel Management drops Claude, adds Grok and Codex to AI use disclosure

ZeroDayBench: Evaluating LLMs on Zero-Day Security

Black Hat USA 2025 | Invoking Gemini for Workspace Agents with a Simple Google Calendar Invite

Firefox taps Anthropic AI bug hunter, but rancid RAM still flipping bits

The Hidden Dependency Risk of AI-Written Code

Transparent Tribe Is Using AI to Scale Spear-Phishing Attacks Against Military and Government Targets

Spyware disguised as emergency-alert app sent to Israeli smartphones

FBI servers were hacked and a data breach might have occurred

Hackers target FBI surveillance network​ | Cybernews

Azure Function Managed Identity: Stop Using Connection Strings

Google Reports 90 Zero-Day Vulnerabilities Exploited by Hackers in 2025

AI-driven cyberattacks surge in Asia-Pacific, IBM warns

Mozilla fixes 22 security flaws flagged by Anthropic's AI

Hardening Firefox with Anthropic’s Red Team

Google Chrome Multiple Vulnerabilities

An Investigation Into Years of Undetected Operations Targeting High-Value Sectors

AI Agent Sandboxes: Securing Memory, GPUs, and Model Access

URGENT: Major Cloud Vulnerability Under Active Exploitation!

Google says 90 zero-days were exploited in attacks last year

Delinea Completes StrongDM Acquisition to Secure AI Agents with Continuous Identity Authorization

ContextCrush Flaw Exposes AI Development Tools to Attacks

Detecting and Mitigating AI-Generated Cyberthreats

Global Surge: 149 Hacktivist DDoS Attacks Target SCADA and Critical Infrastructure Across 16 Countries After Middle East Conflict

Seedworm: Iranian APT on Networks of U.S. Bank, Airport, Software Company | SECURITY.COM

I Built a Security Scanner Because AI Code Scared Me - DEV Community

State-affiliated hackers set up for critical OT attacks that operators may not detect

New Threat Report: AI Accelerates High-Velocity Cyber Attacks

Security in 2026: New Ways Attackers Are Exploiting AI Systems

Surging third-party risks create software vulnerability headaches for developer teams

AI Coding Agents Create New Software Supply Chain Risks as Shai ...

Cloudflare report: Cybercrime industrialized with AI and cloud exploitation | brief | SC Media

Why AI Features Don't Equal Better Vulnerability Management

API Pentesting Methodology Explained: Full Security Workflow

Security briefing: February 2026 | Sysdig

Sonar Unfurls Framework for Managing DevOps Workflows in the Age of AI

From phishing to Google Drive C2: Silver Dragon expands APT41 playbook

AI-Assisted Pentesting Using Claude AI & Kali Linux. Understand MCP Host, Client, Server Connection

AI-Driven Risk & Vulnerability Management: A Strategic Guide

Mexico's $850 Million AI Disaster: Full Forensic Breakdown

AI Agent Overload: How to Solve the Workload Identity Crisis

Building Secure Infrastructure for Productive AI Agents - Eric Paulsen & Jiachen Jiang

The AI Exploit Engine Behind 500+ FortiGate Breaches Is Quietly Going Global Now

Anthropic News | How Hacker Used Anthropic's Claude To Steal 150 GB Of Mexican Data Trove

AI Security Crisis: Jailbreaks, Prompt Injection & How to Protect Your Agents

DeepKeep launches AI agent attack surface scanner to map enterprise risk

Phishing campaign exploits OAuth redirection to bypass defenses

Iranian Cyber Threat Actor Targets Iraqi Government Officials in AI-Powered Campaign

NDSS 2025 – A Comparative Evaluation Of Large Language Models In Vulnerability Detection

Agents of Chaos (arxiv.org/abs/2602.20021)

AI Agents Are Moving Faster Than Security Teams | Security Boulevard Ep. 21

ALERT: Critical RCE Vulnerabilities (3 CVEs)

4 ways AI agents change the way we approach Identity Security | Silverfort

A Secure Framework for Artificial Intelligence

Open-Source CyberStrikeAI Deployed in AI-Driven FortiGate Attacks Across 55 Countries

APTs and Industrial Cybersecurity in the Wake of the Attack on Iran

NIS2 in Croatia: Cybersecurity Law, Regulation, Controls, and Documents

Figure data breach exposes nearly 1M accounts

Introducing the 2026 Cloudflare Threat Report

Hackerbot-Claw Bot Exploits GitHub Actions CI/CD Flaw to Attack Microsoft and DataDog

Fault Lines in the AI Ecosystem: TrendAI™ State of AI Security Report | Trend Micro (US)

Hackers target FBI surveillance network | Cybernews