Frontier Model Watch

7h ago

Claude Code Security Sparks Cybersecurity Stock Plunge

Anthropic's Claude Code Security launch on Feb 20 triggered violent fluctuations and falls in major US cybersecurity stocks, as its reasoning-based vuln detection—simulating human researchers beyond rule-based SAST—threatens traditional tools.

Insights into Claude Code Security: A New Pattern of Intelligent Attack and Defense

securityboulevard.com

Insights into Claude Code Security: A New Pattern of Intelligent Attack and Defense

7h ago

Anthropic Acquires Vercept to Boost Claude's Agentic Edge

Strategic acquisition accelerates Claude’s computer-use capabilities for human-level software proficiency.

Vercept expertise: Advanced AI...

Claude AI maker Anthropic acquires Vercept

7h ago·

yourstory.com

7h ago

Anthropic-Pentagon Clash Exposes AI Warfare Governance Risks

Key tensions in DoD's Claude procurement:

Anthropic bars Claude from mass surveillance of US citizens and fully autonomous weapons; DoD deems...

The Pentagon/Anthropic Clash Over Military AI Guardrails

opiniojuris.org

The Pentagon/Anthropic Clash Over Military AI Guardrails

7h ago

13h ago

Frontier Model Watch · Feb 26 Daily Digest

Anthropic Safety Policy Shifts

🔥 Dropped Training Halt Pledge: Anthropic has dropped a pledge from its Responsible Scaling Policy to halt...

16h ago

Token Games: AI Duel Benchmark Exposes Frontier Model Limits

Token Games introduces puzzle duels where LLM pairs alternate proposing and solving Python puzzles for boolean-true verification.

Tackles high...

16h ago

Claude Code Remote Control Boosts Mobile Coding Edge Over Gemini

Remote control launch: Claude Code now controllable from phone; 4-step setup demoed, ideal for on-the-go projects.
Coding showdown: Claude Opus...

16h ago

Anthropic Dials Back Safety Pledges as Claude Opus 4 Sabotage Risks Stay Low

Policy Shift: Anthropic dropped pledge to halt training without full safeguards, citing competitive pressures from rivals.
Technical Contrast:...

16h ago

MIT RLMs Enable 10M Token Processing via Recursion, Not Bigger Windows

Paradigm shift: MIT CSAIL's Recursive Language Models (RLMs) treat data as external files, letting LLMs write code to search surgically—instead of...

16h ago

Claude Models Hit Vertex AI for Expert Agentic Power

Anthropic's Claude models now live on Google Vertex AI, enabling KI-Agents to tackle complex, multi-stage tasks with precision.

Key strengths:
-...

Claude-Modelle von Anthropic | Generative AI on Vertex AI

16h ago·

docs.cloud.google.com

16h ago

In-Context Probing Cracks Fine-Tuned LLMs Like LLaMA-3

New vulnerability in fine-tuned LLMs: In-Context Probing (ICP) extracts private training data with 94% accuracy using simple prompts—no reference...

1d ago

Anthropic Softens Safety Halt Pledge, Eroding Frontier AI Trust

Key shift in Anthropic's RSP undermines industry safety standards:

From hard commitments to flexibility: No longer automatic halts on catastrophic...

Anthropic Quietly Abandons Its Most Important Safety Promise — And the AI Industry Is Watching

webpronews.com

Anthropic Quietly Abandons Its Most Important Safety Promise — And the AI Industry Is Watching

1d ago

LLM Firewalls: Essential Safeguards for Frontier Model Deployments

Practical protections for operational LLMs: monitor/filter inputs, manage interactions, block data leaks.

Three layers: Prompt firewall blocks...

LLM firewalls emerge as a new AI security layer | TechTarget

techtarget.com

LLM firewalls emerge as a new AI security layer | TechTarget

1d ago

Claude Code Remote Control: Cross-Device Terminal Handoffs Boost Dev Productivity

Anthropic's Claude Code Remote Control lets developers start terminal tasks on one machine and seamlessly continue across devices, unlocking true coding continuity.

Claude Code Remote Control Launch: Seamless Terminal Handoffs Across Devices [2026 Analysis]

1d ago·

blockchain.news

1d ago

Frontier Model Watch · Feb 25 Daily Digest

Policy and Governance Updates

🔥 Responsible Scaling Policy 3.0: Anthropic is releasing the third version of its Responsible Scaling Policy, a...

1d ago

EBMs Crush Frontier LLMs on Single-GPU Efficiency

Kona EBM beats ChatGPT, Claude, DeepSeek on complex reasoning task for $4 compute vs. $15,000 on frontier LLMs
Reasons in abstract latent...

1d ago

Anthropic Exposes Chinese Labs' Massive Claude Theft Operation

Industrial-scale copying: Anthropic accuses DeepSeek, Kimi/Moonshot AI, and MiniMax of using ~24,000 fake accounts for 16M+ Claude interactions to...

1d ago

Pentagon Ultimatum: Drop Claude Weapon Limits or Lose Contract

Friday deadline: Defense Sec. Pete Hegseth demands Anthropic remove Claude restrictions on autonomous weapons and surveillance, or risk losing...

Hegseth Demands Anthropic Drop AI Weapon Limits or Lose Pentagon Contract

substack.com

Hegseth Demands Anthropic Drop AI Weapon Limits or Lose Pentagon Contract

1d ago

Claude AI Tools Target IBM's COBOL Legacy, Stock Slips

Claude disrupts IBM: New AI-powered tools target old COBOL systems, causing IBM stock to slip.

Legacy modernization at risk: Impacts deals for...

2d ago

Anthropic's Claude Blitzes Enterprise with Office Integrations and Plugins

Anthropic escalates enterprise push, embedding Claude in Excel, PowerPoint, Slack for seamless workflows—challenging Microsoft Copilot and OpenAI.

-...