Open-Weight Wave: Gemma, Sonic-3.5, Kimi K2.6 Advance Language, Speech & Coding
Three fresh open-weight releases highlight a clear trend toward specialized, high-efficiency models across modalities.
- Gemma 4 spans 2B–31B sizes...

Created by Theo
latest open-source AI models, tools, benchmarks, and startup commercial updates
Explore the latest content tracked by Open Source AI Digest
Three fresh open-weight releases highlight a clear trend toward specialized, high-efficiency models across modalities.
Q-ARVD introduces frame-weighting and outlier-aware dual-scale quantization to handle ARVD-specific issues like exponentially skewed frame sensitivity...
LM Studio 0.4.14 now includes MTP availability, a direct update for users running compatible models.
TerminalWorld introduces a scalable engine that auto-generates 1,530 tasks from 80,870 real recordings across 18 categories, exposing frontier agents'...
DeepSeek V4 rethinks frontier model design from scratch rather than scaling V3.
SCRL decomposes reference reasoning chains into verifiable subproblems, fixing the final one as the original task to convert partial progress into...
GenEvolve introduces a self-evolving framework that models image generation as tool-orchestrated trajectories, comparing multiple attempts to distill...
Together AI just expanded its on-demand GPU clusters and Dedicated Endpoints with a thousand H100s and H200s. This infrastructure boost directly addresses rising AI workload demands.
TrueFoundry AI Gateway tests show smart routing between GLM-5.1 and Claude Opus 4.7 outperforms single-model strategies.
A repost by Jeremy Howard flags that Gated DeltaNet-2 replicates RWKV-7's DPLR recurrence almost exactly, without credit, raising academic integrity concerns for open-source AI norms.
Two new tokenization methods based on linear programming just dropped on arXiv, both achieving SOTA results. Researchers optimizing tokenizers should explore these for potential efficiency gains in NLP workflows.
DeepSeek-V4-Pro now costs 1/4 of its original price, unlocking impressive agentic workflows that builders can prototype affordably. This drops...
PyTorch 2.12 now ships the mark_kernels context manager for labeling CUDA graph kernels in profiles, moving this capability out of nightlies. The...
DeepSeek positions its new Thinking with Visual Primitives method as a potential game changer for multimodal reasoning, with the associated paper now available for technical exploration.
GitHub Copilot for Eclipse is now open source under the MIT license, giving developers direct insight into how the AI assistant integrates with one of the software industry's oldest platforms.
Hugging Face's Agents Course shows developers how to equip LLMs with tools through precise system prompt engineering.
Gated DeltaNet-2 decouples erase and write operations in linear attention, delivering superior performance over KDA and Mamba-3 at the 1.3B scale....
Commenters indicate the rumored model is likely a Qwen Max-class release, and Qwen has not historically released Max-series models as open weights. This points to continued closed development for advanced variants.
Hugging Face just dropped a fully open-source humanoid robot you can build for $2,500, a bold move to lower the high barriers in robotics. As advocates note, the field is too complex to solve alone, so open-source collaboration is key to progress.