Security incidents, reliability bugs, identity, and governance for Claude agents

Reliability, Incidents & Agent Governance

Recent reliability issues and security incidents within the Claude ecosystem have underscored significant gaps in agent safety, highlighting the urgent need for layered mitigation strategies and improved governance.

Reliability Challenges: Ghost Files and Memory Integrity

One of the most persistent technical problems is the ‘Ghost File’ bug—phantom files that remain inaccessible, undeletable, and inexplicable. These anomalies disrupt workflows, obscure data, and undermine trust, especially in high-stakes environments like scientific research and enterprise data management where data integrity is critical. The root cause is closely tied to memory management complexities, particularly as Claude systems expand their context windows and long-term state capabilities. As shared memory architectures become central to context persistence, ensuring memory integrity becomes increasingly challenging.

Recent demonstrations such as “I Gave Claude Cowork a Memory. Now It Runs My Work” showcase how shared memory architectures enable AI agents to remember and build upon past interactions, transforming them into semi-autonomous collaborators. However, these advances demand robust anomaly detection, state validation mechanisms, and self-healing protocols to prevent memory corruption without human intervention. Projects like “Stop Losing Context: Shared AI Memory for Claude & Cursor” highlight ongoing efforts to refine memory management through automated anomaly detection and periodic health checks, aiming to fortify memory integrity for reliable long-term interactions.

Security Incidents and Vulnerabilities: From CVEs to Exposed Data

Alongside reliability issues, security vulnerabilities have come to the forefront, exposing critical risks:

CVE-2025-59536 and CVE-2026-21852: These recent disclosures reveal Remote Code Execution (RCE) and API token exfiltration vulnerabilities stemming from Claude project files. Malicious actors exploiting these CVEs could execute arbitrary code or extract sensitive API tokens, posing severe security threats.
Exposed Scheduled Tasks: Internal leaks have shown that Claude Code inadvertently made scheduled tasks public, risking privacy breaches by exposing personal data stored in Gmail, calendars, and other integrated services.
Remote-Control Features: The introduction of /remote-control commands in Claude Code, demonstrated in videos like “Claude Code Just Destroyed OpenClaw”, illustrates how agent control functionalities can be weaponized—potentially allowing malicious actors to disable or hijack agents or destroy infrastructure if misused or improperly secured.
Prompt Injection and Secrets Leaks: With context windows expanding to millions of tokens, sophisticated prompt injection attacks threaten to manipulate AI behavior or exfiltrate confidential data. Ensuring prompt validation and secure coding practices is now more critical than ever.

Transitioning Security Controls: From OAuth to Fine-Grained Identity

Historically, OAuth protocols provided delegated access controls within the environment. However, recent shifts favor more integrated, fine-grained identity controls—notably through tools like Aperture—which directly link user identities to AI workflows. This transition enhances security posture by enabling precise permission management, audit trails, and behavioral controls, thus reducing attack surfaces.

The removal of OAuth authentication from Claude and the adoption of API keys and federated identity providers require organizations to reevaluate IAM strategies. Proper identity-linked controls are essential to prevent impersonation and unauthorized actions, especially as long-term autonomous workflows become more prevalent.

Operational and Strategic Action Items

To address these challenges, organizations should:

Implement sandboxing and environment isolation, deploying agents within containerized environments like Deno, NanoClaw, or OpenClaw to restrict code execution and prevent contamination.
Enhance runtime guardrails using tools such as Akto to detect anomalies and intervene proactively.
Secure shared memory with encryption and least privilege policies to prevent leaks and ensure data integrity.
Enforce strict access controls with identity-linked permissions via Aperture or similar solutions.
Monitor platform features—including remote-control, scheduled tasks, and mobile synchronization—for misconfigurations and security risks.
Maintain incident response playbooks and continuous monitoring to rapidly detect and respond to breaches.

Broader Ecosystem Developments and Future Directions

The Claude ecosystem continues to evolve rapidly, with shared memory architectures, personal AI assistants, and skill marketplaces transforming its landscape. Initiatives like “Claude Skills Explained: Complete 2026 Guide” and “Build a Custom AI Workspace with Claude Code” aim to empower developers while emphasizing security best practices.

Recent reports about Claude Code’s ability to make scheduled tasks public and the introduction of remote-control commands demonstrate both power and risk. The incident where Claude Code destroyed OpenClaw through exploiting a new /remote-control command underscores the importance of layered defenses and rigorous security controls.

Organizations must integrate CVE insights into their threat models, conduct targeted security audits, and adopt comprehensive governance frameworks—like MCP Security—to detect, contain, and **remediate exploits effectively.

In summary, as the Claude ecosystem advances, the convergence of reliability issues and security vulnerabilities demands a layered, proactive approach. Ensuring memory integrity, fine-grained identity controls, and robust operational safeguards are vital to harnessing autonomous AI safely and building trustworthy long-term systems capable of supporting complex, high-stakes projects.

Sources (60)

Updated Feb 26, 2026

Security incidents, reliability bugs, identity, and governance for Claude agents

Reliability Challenges: Ghost Files and Memory Integrity

Security Incidents and Vulnerabilities: From CVEs to Exposed Data

Transitioning Security Controls: From OAuth to Fine-Grained Identity

Operational and Strategic Action Items

Broader Ecosystem Developments and Future Directions

Caught in the Hook: RCE and API Token Exfiltration Through Claude Code Project Files | CVE-2025-59536 | CVE-2026-21852

wait… claude code just made scheduled tasks public i've got one ...

Claude Code Just Destroyed OpenClaw (new /remote-control command)

Anthropic reveals mobile version of Claude Code to keep you productive

Why Context And Integration Are The Real AI Advantage

Claude Code Remote Control — Control Your Terminal from Your Phone

Anthropic expands Cowork plugins across enterprise functions

toktrack

Anthropic says Claude Code transformed programming. Now Claude Cowork is coming for the rest of the enterprise.

When Agentic AI Becomes Your Riskiest Third Party

My COMPLETE Agentic Coding Workflow to Build Anything (No Fluff or Overengineering)

How to automate dbt project migration with dbt Agent Skills + Claude Code Opus 4.6

Claude Skills Explained: Complete 2026 Guide

How I Use Obsidian + Claude Code to Run My Life

Build a Custom AI Workspace for Any Business with Claude Code

When AI Agents Go Rogue: How an OpenClaw Bot Hijacked a Meta Researcher’s Inbox and What It Means for Enterprise Security

One engineer made a production SaaS product in an hour: here's the governance system that made it possible

Manager Protocol Demo: MCP Server for AI Agent Governance & Compliance

The agentic researcher - building custom, transparent and extensible workflows with Claude & MCP

I Read the Secret Instructions Behind Claude Code & Cursor. Here's What You Need to Know.

Show HN: AgentReady – Drop-in proxy that cuts LLM token costs 40-60%

You’re Using AI Coding Tools All Wrong | by Andy Nguyen | Synthetic Futures | Feb, 2026 | Medium

AI Code Assistants Are Only as Good as Your Design Thinking | by Katharina Pilz | Feb, 2026 | Medium

🎙️ This week on How I AI: How Notion’s design team uses Claude Code to prototype

Claude Code: Resume Sessions Without Context Loss | rigel-computer.com

Why Most Developers Are Using Claude Code Wrong (Here's What You're Missing)!

Claude Code CLI: The Definitive Technical Reference - Blake Crosley

NEW Claude Sonnet 4 6 Update FREE!

I Gave Claude Cowork a Memory. Now It Runs My Work.

Moving Through the AI Adoption Curve: A Practical Guide for Non ...

Claude Code’s Hidden Cost Problem: Developers Sound the Alarm on Anthropic’s AI Coding Agent Billing Practices

The Software Engineer's Guide to Claude Code

Claude Code Crash Course For Beginners (4 Builds Easy to Advanced)

AI assisted coding with Claude Code - PyCon DE & PyData 2026

Stop Losing Context: Shared AI Memory for Claude & Cursor

OpenClaw, NanoClaw, Personal AI Assistants and Skill Economy

MCP Resources: A Better API Strategy for AI | by Nagaraj | Feb, 2026

MCP Security: The Exploit Playbook (And How to Stop Them)

Anthropic’s Claude Code Security puts AI on bug patrol

Claude Code’s ‘Ghost File’ Bug Exposes a Thorny Problem in AI-Powered Development Tools

How Trail of Bits uses Claude Code, GitHub Threat Intel, Open Source AI ...

Claude OAuth Is Being Removed: Here's What to Do Next

Reload Raises $2.275M and Launches Epic to Manage AI Agents’ Memory

Anthropic rolls out embedded security scanning for Claude

Anthropic Launches Claude Code Security to Hunt Zero-Day Vulnerabilities

Is Data Engineering Becoming Context Engineering? [2026]

AI agent autonomy rises as users gain trust in Anthropic’s Claude Code

ContextBench: A Benchmark for Context Retrieval in Coding Agents

Guardians of the Code: AI Powered Token and Session Management ...

I traced 3,177 API calls to see what 4 AI coding tools put in the context window

The Research Is Clear: Coding Agents Are Bottlenecked by Search, Not ...

Akto Introduces Runtime Guardrails to Claude Code

Level Up Your Mastra Agent's Memory with Observational Memory (Record LongMemEval Scores)

Your Vault Protects Your Secrets — Until Claude Code Runs Your Tests

@omarsar0 reposted: Managing rules for coding agents is a headache. Claude Code, Cursor, Copilot......

Tailscale launches Aperture in open alpha for identity-linked governance of AI tools and agents

At least one major incident will be traced back to an AI coding tool

Running AI Code Assistants Locally with Ollama and Docker: Truly Free Development | by Prusov Sergei | Feb, 2026 | Medium

Stop Getting Paged at 3am: Let AI Fix Your Production Bugs

Running NanoClaw in a Docker Shell Sandbox