Enterprise agents: Glasswing/Mythos cyber/visual/Claude Code/MCP/pricing/outage/OpenClaw/Kaggle evals, GLM-5.1/CORAL/Gemini/Copilot/Nutanix/Agentic-MME

Key Questions

What cybersecurity capabilities does Anthropic's Glasswing offer?

Glasswing detects zero-days in OS/FFmpeg with 72% chain detection and partners for fixes. It's a response to Mythos' dangers. It redefines AI's role in cybersecurity.

What is Claude Mythos and why limited?

Claude Mythos Preview excels at identifying thousands of zero-days but is limited due to cyberattack risks. Available in private preview on Vertex AI. Anthropic prioritizes safety.

What are Claude Code features?

Claude Code supports apps, terminal use, MCP, and PDF import via Weaviate Agent Skills. It faced an outage affecting thousands. Pricing adjustments impact access.

What benchmarks does GLM-5.1 lead?

GLM-5.1 scores 58.4% on SWE-Bench for long-horizon agentic coding. It's optimized for 600+ iterations. Tops open source leaderboards.

What is Kaggle's new initiative?

Kaggle launched Benchmarks Resource Grants for AI evals, providing compute and SDK. This supports rigorous evaluations. It signals focus on agent benchmarks.

What agentic platforms were announced?

Nutanix offers agentic hybrid multicloud; QoderWork is a desktop AI agent for real work. Karpathy's CORAL enables multi-agent discovery. Gemini MAIA and Copilot advance enterprise use.

What is the ROI for AI infrastructure per I&O?

Only 28% of AI projects fully pay off, per surveys. This underscores TCO and uptime challenges. Security and evals remain key concerns.

What are AgentHazard and Agentic-MME?

AgentHazard scores 73%; Agentic-MME evaluates multi-modal agents. They highlight enterprise agent progress. Ongoing focus on security and benchmarks.

Glasswing cyber zero-days (OS/FFmpeg,72% chains/partners)/visual reasoning; Claude Code apps/terminal/MCP/outage/pricing/OpenClaw; Kaggle benchmarks grants (evals compute/SDK); GLM-5.1 SWE-Bench 58.4%; Nutanix Agentic hybrid; QoderWork/Weaviate/Karpathy CORAL/Qwen SAM3/Gemini MAIA/Copilot/AgentHazard 73%/Agentic-MME/I&O 28% ROI. Ongoing TCO/security/evals/uptime/GLM/Kaggle.

Sources (108)

Updated Apr 8, 2026

Enterprise agents: Glasswing/Mythos cyber/visual/Claude Code/MCP/pricing/outage/OpenClaw/Kaggle evals, GLM-5.1/CORAL/Gemini/Copilot/Nutanix/Agentic-MME

Key Questions

What cybersecurity capabilities does Anthropic's Glasswing offer?

What is Claude Mythos and why limited?

What are Claude Code features?

What benchmarks does GLM-5.1 lead?

What is Kaggle's new initiative?

What agentic platforms were announced?

What is the ROI for AI infrastructure per I&O?

What are AgentHazard and Agentic-MME?

Anthropic’s Glasswing: The Quiet Bet That Could Redefine How AI Models Actually See the World

Kaggle launches Benchmarks Resource Grants for AI evaluation

@weaviate_io: PDF import just landed in Weaviate Agent Skills! Point Claude Code (or any agent) at a PDF, and it ...

@Scobleizer: Meet QoderWork — a desktop AI agent that doesn't just chat, it actually does your work. Evan leads ...

Anthropic limits Mythos AI rollout over fears hackers could use model for cyberattacks

GLM-5.1 Developer Guide: Long-Horizon Agentic Coding | Lushbinary

Anthropic's latest AI model identifies 'thousands of zero-day vulnerabilities' in 'every major operating system and every major web browser' — Claude Mythos Preview sparks race to fix critical bugs, some unpatched for decades

Anthropic says its most powerful AI cyber model is too dangerous to release publicly — so it built Project Glasswing

@Scobleizer reposted: 🚀The era of autonomous multi-agent discovery is arriving! @karpathy 🪸Excited t...

Anthropic debuts preview of powerful new AI model Mythos in new cybersecurity initiative

Claude Mythos Preview on Vertex AI | Google Cloud Blog

Getting Started with Claude Code - Dometrain

Microsoft TOS: Copilot is for 'entertainment purposes only,' not 'important advice'

Claude AI Goes Down for Thousands of Users, Downdetector Reports

@danshipper: gpt-5.4 up 8.9% in usage this week after OpenClaw gets banned in Claude subscriptions https://t.co/5...

@ClementDelangue: We keep saying we want open-source frontier agents. Fine. Then let’s build the dataset. @badlogicg...

Switching to Claude? Transfer Your ChatGPT History With This Easy Trick

Chinese OTAs deploy AI for global push

Google study finds LLMs are embedded at every stage of abuse detection

@rubenhassid: Everyone is talking about Claude in 2026. But almost no one has set it up like this: Set-up 1. Cr...

How to use the new ChatGPT app integrations, including DoorDash, Spotify, Uber, and others

Anthropic Puts a Price Tag on OpenClaw: What Claude’s New Paywall Means for AI Model Evaluation

Stop using Claude as just a chatbot—MCP changes everything

AgentHazard Benchmark Finds Computer-Use Agents Fail Safety Tests at High Rates – MegaOne AI

Claude MCP Explained: The Future of AI System Integration

Modder Uses Claude AI to Rewrite BIOS, Boot Bartlett Lake CPU

Clinical AI Launches MAIA™ Prescreening on Google Cloud Marketplace to Accelerate Clinical Trials

Anthropic tightens Claude access for third-party AI tools like OpenClaw

@_akhaliq: Agentic-MME What Agentic Capability Really Brings to Multimodal Intelligence? paper: https://t.co/...

Kairos Agent - Claude CLeak EXPOSES Secret Always ON AI Agent…

I asked Claude AI to Create a FULLY AUTOMATED BOT for me and it is ACTUALLY PROFITABLE

I Asked Claude Cowork About My Email. It Became a Running System

Axonis’ Partner Ecosystem to Accelerate Trusted AI Enterprise Adoption

Nanocode: The best Claude Code that $200 can buy in pure JAX on TPUs

DigitalOcean Deepens AI Agent Focus With Katanemo And Plano Acquisition

Claude Code Automated My Entire SEO Content Pipeline

How I Built an AI Agent That Runs My Entire Business | Claude Code Full System

Connect Claude To Top Thinkers (1 Book = 5 Skills)

I hit Claude’s new usage limits — and It changed how I use AI forever

Microsoft Builds Its Own AI Brain — and That Changes Everything for Google, OpenAI, and the Rest of Silicon Valley

Anthropic says Claude Code subscribers will need to pay extra for OpenClaw usage

Microsoft Is Going Multi-Model with Copilot. Does the Enterprise King ...

Claude's OpenClaw TERMINATION Explained! Time for Fully Local Ai?

OpenClaw + FREE Local AI | Run Qwen 3.5 27B on ANY PC 🤯

OpenClaw Without Claude? My New Plan

14 Things Anthropic Tells Claude Code NOT to Do | by Marco Kotrotsos | Apr, 2026 | Medium

Moonbounce Launches with $12M - VC News Daily

Google ADK Review: The Agent Framework for Gemini

OpenRouter Model Fusion

Sandbox Strategy Game for AI

AI Training Data Giant Mercor Is Reportedly Looking to Buy the Work You Did at Your Old Job

Mercor, a $10 billion AI startup that works with companies including OpenAI and Anthropic, confirms major data breach

Meta Halts Mercor Partnership After AI Training Data Breach

Meta Pauses Work With Mercor After Data Breach Puts AI Industry Secrets at Risk

A company that makes AI training data has been hit by a security breach. | The Verge

AI Startup Mercor, Which Works With Open AI and Anthropic, Confirms Data Breach

Moonbounce Raises $12M to Give Organizations Real-Time Control Over AI Behavior

Former Meta safety lead raises $12M to steer AI in real time

KKR’s AI Push And Record Fund Raise Contrast With Valuation Concerns

I Built a Production Instagram AI Agent Using Claude

Anthropic’s next model could be a ‘watershed moment’ for cybersecurity. Experts say that could also be a concern

Anthropic essentially bans OpenClaw from Claude by making subscribers pay extra

Anthropic leak sparks warnings over AI-driven cyber threats

Anthropic’s Claude Mythos Leak Is Bigger Than You Think

How to USE Claude Code for FREE with Ollama ( Local AI FULL Tutorial)

I Built a Tool That Runs Claude Code with ANY AI Model — GPT, Codex, Gemini, DeepSeek, Free Models

How to Build Real-World AI Agents with Qwen3.6-Plus | by Sebastian Buzdugan | Apr, 2026 | Medium

Everything you need to know about Moltbook, the 'Reddit for OpenClaw agents' that got acquired by Meta

Claude Leaks its Source Code… then Files Copyright Claim

Stop Paying $200 for Claude Code! 🛑 Get it for FREE with Ollama + Qwen + VS Code (The Truth)!

@omarsar0 reposted: NEW Stanford & MIT paper on Model Harnesses. Changing the harness around a ...