Burr and AgentScope 2.0 for Reliable AI Agents
- Apache Burr targets building reliable AI agents and applications
- AgentScope 2.0 provides APIs for LLM-powered agent development from Alibaba
-...

Created by Justin Hubbard
Latest AI research and applied updates on foundation models, multimodal vision, and safety
Explore the latest content tracked by AI Breakthrough Radar
The gap between public AI safety rhetoric and internal corporate responses is widening:
New research spotlights core transformer weaknesses:
BostonGene is deploying next-generation AI foundation models with advanced architectures to integrate multimodal oncology and immunology data, as highlighted in their upcoming presentation.
Apple waives cloud API costs for smaller developers to make AI experimentation cheaper, positioning this as a potential tactic to acquire developers and expand its AI ecosystem amid questions about hidden catches.
NVIDIA's Nemotron 3 marks a shift from stitched multimodal pipelines to native integration via a 30B-A3B MoE backbone handling text, image, video, and...
Anthropic withheld Mythos 5 due to biological and cybersecurity risks, releasing only Fable 5 with strict guardrails around the shared core model.
-...
Two new systems highlight the move from task-specific training to zero-shot, affordance-aware embodied models.
Google's DiffusionGemma stands out as an open experimental model extending text diffusion research to Gemma 4.
Three complementary techniques are emerging to remove bottlenecks in unified multimodal systems:
The 2026 International AI Safety Report concludes that reliable pre-deployment safety testing faces fundamental limits, underscoring the need for novel post-training evaluation designs in practice.
New frameworks reveal a clear shift toward self-organizing and delegating AI agents that tackle extended professional and research workflows without...
Two fresh arXiv papers target core RL bottlenecks in LLMs, promising more precise and stable training.
Three new methods mark the shift from frozen agent designs to systems that adapt their own harnesses and prompts at runtime.
AWS Bedrock now requires 30-day data retention for Mythos-class models like Fable 5, sending all traffic outside AWS boundaries to Anthropic. This...
Apple's third-generation foundation models center on privacy via Private Cloud Compute, with data never stored or shared.
A new Data Journalist Agent (Data2Story) acts as a virtual newsroom, turning raw data into evidence-grounded multimedia stories.
Google's Gemini 3.5 Live Translate delivers near real-time speech-to-speech translation across 70+ languages while preserving intonation, pacing, and...
Robotics systems must be secure, trusted, and resilient as they integrate into the economy. Miles Brundage's repost amplifies this governance priority for safe real-world deployment.
Decart's Oasis 3 generates hours of photorealistic, multi-camera driving environments in real time via API, targeting autonomous vehicle testing and...