Anthropic accuses Chinese labs of distillation/extraction

Anthropic: Model Misuse Allegations

Anthropic Accuses Chinese Labs of Illicit Model Distillation Amid Rising Industry Turmoil

The artificial intelligence (AI) landscape has entered a tumultuous phase, marked by serious allegations, technical vulnerabilities, and geopolitical tensions. At the center of this upheaval is Anthropic, which has publicly accused Chinese AI laboratories—DeepSeek, Moonshot, and MiniMax—of engaging in illicit activities aimed at extracting and reverse-engineering its flagship language model, Claude. These developments not only threaten proprietary innovation but also underscore broader concerns about international IP protection, security, and governance in AI.

Main Event: Accusations Against Chinese Labs

Anthropic's charges are unprecedented in their scope and gravity. The company asserts that these Chinese labs are systematically harvesting outputs from Claude through advanced model distillation techniques—methods designed to analyze a model’s responses and approximate its internal behaviors without direct access to the underlying code. This clandestine activity bypasses traditional R&D and licensing channels, raising serious legal and ethical questions about cross-border research conduct.

Key allegations include:

Persistent WebSocket & RAG (Retrieval-Augmented Generation) Harvesting:
Chinese entities are accused of deploying automated, continuous interaction sessions—notably via WebSocket interfaces—to collect extensive response datasets from Claude. These datasets are then analyzed to reverse-engineer the model’s internal knowledge representations and response patterns, risking exposure of Claude’s proprietary architecture.
Prompt & Skill Injection Tactics ("Cat-and-Mouse Game"):
Industry sources highlight ongoing efforts to employ sophisticated prompt engineering techniques—such as "Skill-Inject" and "Crescendo Effect"—to manipulate Claude into revealing sensitive information or bypassing safety mechanisms. As noted by @svpino, "Skills in Claude code right now are a cat-and-mouse game. Today, they work. Tomorrow, they fail." This illustrates the rapidly evolving nature of these illicit efforts.
Exploitation of Claude’s XML Tagging & Internal Architecture:
Recent insights reveal that Claude utilizes XML tags for command parsing and internal controls. Malicious manipulation of these tags can confuse response attribution and expose internal prompts, creating additional attack vectors and complicating response attribution and enforcement.
Operational Disruptions & Outages:
In recent weeks, Claude’s platforms—web, mobile, and API—have experienced widespread errors and outages. These disruptions hamper real-time detection of illicit activities and may signal internal stress or security breaches linked to the ongoing extraction efforts.

Technical Vulnerabilities & Evolving Tactics

The controversy spotlights notable vulnerabilities in current AI deployment architectures:

RAG & WebSocket Interfaces:
Designed for interactive, real-time use, these interfaces lack robust security controls. Their persistent connection capabilities make them attractive targets for long-term data harvesting.
Prompt Injection & Context Leaks ("Whisper Leak"):
Malicious actors deploy advanced prompt-engineering techniques, including "Whisper Leak" and "Skill-Inject" methods, which embed hidden instructions or manipulate context to disclose proprietary information or bypass safety features.
Model Watermarking & Behavioral Anomaly Detection:
Experts are emphasizing the importance of watermarking models and implementing behavioral detection systems to identify suspicious activity. Such measures are becoming essential tools in deterring illicit copying.
Claude’s XML Command Parsing & Attack Surface:
The use of XML tags introduces additional vulnerabilities. Manipulating these tags can confuse the model’s response attribution, disclose internal prompts, or disrupt normal operations.

Operational Disruptions and Strategic Implications

Adding urgency, Claude has recently experienced significant outages, affecting user access and platform stability. As highlighted in reports like "Is It Just Me – Or Are Outages Everywhere Lately? (Claude, GitHub, Supabase)", these issues persist and impact global users.

Implications include:

Hampered Detection & Response Capabilities:
Outages limit real-time monitoring, making it more difficult to identify and counter illicit activities.
Potential Link to Malicious Activities:
Industry insiders suggest these system stresses may be connected to ongoing data harvesting efforts or system overloads caused by sustained malicious activity.
Broader Industry & Geopolitical Strain:
These developments unfold amid escalating US–China tensions, especially around trade restrictions and AI export controls. Notably, Anthropic’s recent break with the Pentagon over IP security concerns exemplifies the growing mistrust complicating international cooperation.

Broader Industry & Governance Response

In response, the AI community is accelerating efforts to enhance security, transparency, and governance:

Provenance & Licensing Frameworks:
Establishing transparent licensing standards and model traceability to detect unauthorized data harvesting and prevent illicit copying.
Response Verification & Watermarking:
Implementing model watermarking and behavioral anomaly detection to identify suspicious activities and build trust in AI deployments.
Enhanced API & Deployment Controls:
Transitioning towards restricted, monitored APIs with context management and response validation to limit knowledge leakage.
Industry Initiatives & Strategic Moves:
Projects like JetStream, backed by Redpoint Ventures and CrowdStrike, aim to bring governance and traceability tools to enterprise AI. Similarly, ServiceNow’s acquisition of Traceloop reflects a focus on AI governance and transparency.

Recent Developments: Chinese AI Progress & "Something is afoot in the land of Qwen"

Amid these tensions, Chinese AI labs are making significant strides:

Latest Open-Source Artifacts:
Recent releases include Qwen 3.5, GLM 5, and MiniMax 2.5, indicating rapid growth and capability expansion among Chinese startups. These models compete directly with Western counterparts and are often paired with advanced deployment tools.
Industry & Geopolitical Context:
CNBC’s Deirdre Bosa notes that Chinese AI firms are advancing despite US export restrictions, with companies like MiniMax pushing model sizes and skills forward. This intensifies competition and raises concerns about technological sovereignty.
"Something is afoot in the land of Qwen":
As reported on Hacker News, Simon Willison’s Weblog highlights that Qwen and related models are progressing rapidly, signaling a broader push in Chinese AI development. The phrase encapsulates the sense that significant activity and strategic moves are underway in China’s AI sector, with potential implications for global AI dominance.

Current Status & Broader Implications

The situation remains fluid and urgent:

Legal & Regulatory Actions:
Expect lawsuits and enforcement actions against the implicated Chinese labs, with damages, injunctions, and increased scrutiny on cross-border IP infringements. International standards for AI governance are likely to be reinforced.
Operational & Security Measures:
Claude’s ongoing outages underscore the need for technical safeguards—including watermarking, anomaly detection, and response verification—to prevent further IP theft.
Geopolitical & Industry Trajectory:
This incident highlights the geopolitical stakes of AI development, especially in the context of US–China rivalry. International cooperation and trust-building will be vital to mitigate illicit activities and foster a secure, trustworthy AI ecosystem.

Conclusion

The recent allegations by Anthropic, coupled with Claude’s operational disruptions and the broader geopolitical landscape, mark a pivotal moment in AI development. As illicit model distillation and cross-border IP violations come into sharper focus, the industry must adopt comprehensive security measures, strengthen governance frameworks, and foster international collaboration.

This unfolding scenario underscores the delicate balance between AI innovation and responsible deployment. Moving forward, robust, collaborative efforts will be essential to curb illicit activities, protect intellectual property, and build a resilient, trustworthy AI ecosystem. The industry’s response now will shape the future of AI security, governance, and global cooperation.

Sources (23)

Updated Mar 5, 2026

AI Industry Insight

Anthropic accuses Chinese labs of distillation/extraction

Anthropic Accuses Chinese Labs of Illicit Model Distillation Amid Rising Industry Turmoil

Main Event: Accusations Against Chinese Labs

Technical Vulnerabilities & Evolving Tactics

Operational Disruptions and Strategic Implications

Broader Industry & Governance Response

Recent Developments: Chinese AI Progress & "Something is afoot in the land of Qwen"

Current Status & Broader Implications

Conclusion

Something is afoot in the land of Qwen

Cybersecurity Heavyweights Launch JetStream with $34M Seed Round to Bring Governance to Enterprise AI

@natolambert: Latest open artifacts (#19): Qwen 3.5, GLM 5, MiniMax 2.5 — Chinese labs' latest push of the frontie...

@svpino: Skills in Claude Code right now are a cat-and-mouse game. Today, they work. Tomorrow, they fail. T...

Chinese AI startups see progress amid U.S. AI trade concerns

ServiceNow acquires Traceloop to close gaps in AI governance

AI Summitry — Growth Beats Safety - CEPA

Anthropic's Claude sees 'elevated errors' as it tops Apple's free apps after Pentagon clash

Claude is down: What we know about the Anthropic outage

Is It Just Me – Or Are Outages Everywhere Lately? (Claude, GitHub, Supabase)

Anthropic's Claude AI suffers 'elevated errors' amid surge in popularity

How talks between Anthropic and the US Defense Department fell apart

Multilingual prompt steering in summaries & AI safety evaluation to guardrails - Hacker News (Feb...

OpenAI WebSocket Mode for Responses API

Skill-Inject: New LLM Agent Security Benchmark

The "Whisper Leak" Attack: Are Your AI Chats being Exposed?

Anthropic narrows AI safety policy pledge

The Crescendo Effect: Social Engineering Agentic AI & RAG Vulnerabilities | DataDrivenInvestor

The Ozkaya AI Governance Framework (OAIGF)

Why XML tags are so fundamental to Claude

Mind the Ethics! The Overlooked Ethical Dimensions of GenAI in ...

Anthropic accuses three Chinese AI labs of abusing Claude to improve their own models

Anthropic Says DeepSeek, MiniMax Distilled AI Models for Gains