DeepSeek V4's Efficiency Breakthroughs
- 1.6T-parameter open-source model rivals Opus and GPT-5 while activating just 49B params via 256-expert MoE (9 active per token)
- Trained on Huawei...

Created by YiYi Jin
Real-world AI product announcements, models, and tools from industry leaders
Explore the latest content tracked by AI Launch Radar
EvoDS tackles static skills and context limits in LLM agents with Autonomous Skill Acquisition and Adaptive Context Compression, enabling...
Superfact solves the friction of exporting real work from AI chats by letting users type one command like "superfact me into a summary" in Claude,...
Microsoft positioned itself as an AI agents company at Build 2026 with three major moves reshaping enterprise tooling.
Zapier open-sourced its GTM agents as a GitHub repo of agent skills and pre-built automations, built on its platform.
Enterprise AI is moving away from flat-rate subscriptions toward usage-based pricing that mirrors cloud compute bills.
Claude now writes 80%+ of merged production code at Anthropic, driving 8x daily merges versus 2024.
Google's Gemma 4 12B delivers near-26B performance in a compact package that runs locally on 16GB RAM devices.
MMPO introduces Belief Entropy to deliver fine-grained supervision for memory policies in LLM agents, explicitly penalizing summaries that increase...
Glean now supports NVIDIA Nemotron 3 Ultra, giving enterprises an open-weight model option that delivers 91% of frontier LLM performance at lower cost...
Google's Gemma 4 12B processes images and text in one unified transformer stream, ditching separate vision encoders entirely.
Suno raised $400M to launch a new AI music model developed in direct partnership with the music industry establishment. This approach points to creative tools designed for collaboration with labels and rights holders rather than conflict.
A wave of capable open-weight releases now runs fully on single consumer GPUs, cutting cloud dependency for local AI builds.
NVIDIA's Nemotron 3 Ultra targets the shift from chatbots to long-running agents that plan, use tools, code, and maintain context across complex...
Enterprises must abandon single-cloud strategies for connected ecosystems as AI workloads spread across clouds, data centers, and edge, making...