AI Model Release Tracker

2h ago

AI Model Release Tracker · Jul 17 Daily Digest

Flagship Open Model Releases

🔥 Kimi K3: Moonshot AI released Kimi K3, a 2.8T-parameter open-weights multimodal model with 1M context that leads...

gigazine.net

GPT-5.6 Solに匹敵する中国製AIモデル「Kimi K3」が登場

3h ago

Kimi K3 Challenges US Leaders with Open Weights and Strong Benchmarks

Moonshot AI released Kimi K3, a 2.8-trillion-parameter open model with 1M context and native multimodal support, featuring architectural advances for...

Chinese startup Moonshot AI unveils Kimi model it says rivals OpenAI, Anthropic

cnbc.com

Chinese startup Moonshot AI unveils Kimi model it says rivals OpenAI, Anthropic

3h ago

Gemini 3.5 Flash Delivers 1M Context and Sub-Second Latency

Gemini 3.5 Flash launches as a high-speed multimodal model optimized for agentic workflows and complex coding.

1M-token context processes hour-long...

Gemini 3.5 Flash: 1M Context & Sub-Second Latency

automatio.ai

Gemini 3.5 Flash: 1M Context & Sub-Second Latency

3h ago

6h ago

WanSong v1.0: Pure Diffusion for Controllable 5-Minute Songs

WanSong v1.0 delivers a pure diffusion-based model that generates high-fidelity, multilingual songs up to 5 minutes while outputting dual stems in a...

arxiv.org

WanSong v1.0 Technical Report

6h ago

New Benchmark Targets Gaps in MLLM-as-Judge Evaluation

MLLMs are emerging as judges for precise multimodal evaluations, yet existing benchmarks organized by task types overlook core judgment capabilities...

Advancing Multimodal Judge Models through a Capability ...

6h ago·

arxiv.org

6h ago

VideoChat3: Fully Open 4B Video MLLM Released

VideoChat3 delivers a fully open 4B video MLLM that outperforms prior open-source models on general, long-form, and streaming benchmarks.
-...

VideoChat3:Fully Open Video MLLM for Efficient and ...

6h ago·

arxiv.org

6h ago

Kimi K3 Signals Open Models Have Caught Frontier Proprietary Rivals

Moonshot AI's 2.8T-parameter Kimi K3 shows open-weight models have reached parity with US frontier systems, topping benchmarks in coding and browsing...

Kimi K3 shows open AI models have finally caught up with proprietary US-based rivals

neowin.net

Kimi K3 shows open AI models have finally caught up with proprietary US-based rivals

6h ago

VideoChat3: Fully Open 4B Video MLLM Outperforms Larger Models

VideoChat3 introduces a fully open video-centric MLLM with 4B parameters that delivers strong generalization across general, long-form, and streaming...

VideoChat3: Fully Open Video MLLM for Efficient and Generalist Video Understanding

arxiv.org

VideoChat3: Fully Open Video MLLM for Efficient and Generalist Video Understanding

6h ago

MCPEvol-Bench Exposes Need for Dynamic Tool Evolution Benchmarks

Existing MCP-based benchmarks overlook continuous tool interface evolution, producing flawed evaluations of LLM agent adaptability. MCPEvol-Bench...

MCPEvol-Bench: Benchmarking LLM Agent Performance ...

6h ago·

arxiv.org

11h ago

DeepSeek V4 Pro (Max) vs GPT-5.6 Luna: Math and Cost Wins

GPT-5.6 Luna leads the overall aggregate (81 vs 76) across 20 shared benchmarks, with big edges in knowledge (92.3 vs 60.4) and agentic tasks like...

benchlm.ai

DeepSeek V4 Pro (Max) vs GPT-5.6 Luna

11h ago

Sol's Vision Gains Unlock Practical UI Agents and 3D Viz

Sol's major gains in object detection and counting turn OpenAI's highlighted UI agents and detailed 3D visualizations from demos into practical tools....

GPT 5.6 Sol is the best "vision" model OpenAI ever released

blog.roboflow.com

GPT 5.6 Sol is the best "vision" model OpenAI ever released

11h ago

Inkling: Unhyped US Contender for Enterprise Fine-Tuning

Thinking Machines Lab's Inkling arrives as a 975B MoE open-weights model optimized for enterprise customization rather than raw leaderboard...

U.S. Unhyped New Open-Source SOTA Model Inkling

eu.36kr.com

U.S. Unhyped New Open-Source SOTA Model Inkling

11h ago

Kimi K3 Shatters Scale Records, Upends US Lead

Moonshot's Kimi K3 arrives as the largest open-weights model at 2.8 trillion parameters, set for public release July 27. Its benchmarks place it just...

China’s Moonshot throws down the gauntlet with Kimi K3, the world’s largest open-weights model

siliconangle.com

China’s Moonshot throws down the gauntlet with Kimi K3, the world’s largest open-weights model

11h ago

Boogu-Image-0.1 Matches Closed-Source Image Models at Low Cost

Boogu-Image-0.1 is a competitive Apache-2.0 open-source model family (Base, Turbo, Edit) for unified image generation and editing that matches closed-source systems using only 208M images and ~$400K compute.

15h ago

GPT-Red Automated Red-Teaming Powers GPT-5.6 Sol Safety

GPT-Red's self-play RL training generated diverse prompt injection attacks that trained GPT-5.6 Sol, slashing its vulnerability rates to ~3.8% for...

OpenAI has announced 'GPT-Red,' an AI model that ...

gigazine.net

OpenAI has announced 'GPT-Red,' an AI model that ...

15h ago

Inkling Debuts as Top US Open-Weight Multimodal Challenger

Inkling (975B total/41B active) is Thinking Machines' first open-weights multimodal MoE, pretrained on 45T tokens across text/images/audio/video...

[AINews] Thinky's Inkling: 975B-A41B multimodal, new best American Apache 2.0 open model (with Inkling-Small, 276B-A12B)

latent.space

[AINews] Thinky's Inkling: 975B-A41B multimodal, new best American Apache 2.0 open model (with Inkling-Small, 276B-A12B)

15h ago

19h ago

Gemini 3.5 Pro: Google's Rebuilt Flagship Awaits Launch

Google has confirmed Gemini 3.5 Pro is internally deployed as the successor to Gemini 3.1 Pro, positioned above the May 2026 Gemini 3.5 Flash release...

What Is Gemini 3.5 Pro? Google's 2M-Context Flagship

kie.ai

What Is Gemini 3.5 Pro? Google's 2M-Context Flagship

19h ago

US Open-Weight Frontier Model Challenges Chinese Lead

Thinking Machines Lab's Inkling release supplies Western enterprises a US-developed alternative to dominant Chinese open-weight models like DeepSeek...

Inkling Ships: Murati's Lab Puts Largest US Open-Weight AI on Hugging Face

techtimes.com

Inkling Ships: Murati's Lab Puts Largest US Open-Weight AI on Hugging Face

19h ago

Xiaomi, Nvidia Push Robotics World Models Forward

Two major world models for physical AI emerged this week, signaling accelerating convergence of foundation models and robotics.

Xiaomi...

Xiaomi Open-Sources Robotics World Model Behind an 82× Data Generation Speedup

techtimes.com

Xiaomi Open-Sources Robotics World Model Behind an 82× Data Generation Speedup

19h ago

Community Applauds Inkling's Low-Hype Style and US Open-Weight Push

The AI community is highlighting Thinking Machines' restrained, detail-rich launch of Inkling as a refreshing contrast to typical hype cycles.

-...

Google Gemini 3.5 Flash + Omni multimodal/agentic; new Flash checkpoint on LM Arena; Pro benchmark leak

Digest Calendar

Recent Posts