LLM Release Radar · Mar 27 Daily Digest
New Open-Source Models
- 🔥 Chroma Context-1: Chroma introduced Context-1, a 20B parameter agentic search model with retrieval performance...

Created by Book Cover Non Judge
Latest LLM releases, context windows, pricing, benchmarks, and licensing details
Explore the latest content tracked by LLM Release Radar
TurboQuant breakthrough for local AI:
Google launched Gemini 3.1 Flash Live, designed for real-time voice and vision agents with ultra-low latency and more natural conversations.
Key...
Japan's pure full-scratch LLM breakthrough:
Google Research's TurboQuant slashes LLM KV cache memory by at least 6x with zero accuracy loss.
Practical wins for model testers:
Qwen3.5 leads open models for practical local agents:
IBM and partners have contributed the llm-d project to CNCF, boosting open-source AI infrastructure for scalable LLM ops.
MiniCPM delivers giant-level performance on everyday hardware. Key practical details for local runs:
Ai2 launched MolmoWeb, an open-source visual AI agent on Molmo 2 (4B/8B params) for browser control—free weights, data, code (soon) for local/cloud...
Chroma study: 18 models show bigger context windows degrade performance every time.
WISC strategies boost agents:
Rising actionable guides simplify local open-source LLMs:
Yandex's Alice AI LLM, Russia's next-gen proprietary model, completed a full training cycle on its own data and infrastructure—initialized from custom base. Key peek into Russia's advanced, self-reliant AI stack.