Google Gemma 4 open multimodal/agentic family + edge SOTAs + INT4 quants

Key Questions

What is Google Gemma 4?

Google Gemma 4 is a family of open-weight multimodal and agentic AI models released by Google DeepMind under Apache 2.0 license. It includes variants like E2B, E4B, 26B MoE, and 31B dense models, supporting text, image, and audio inputs with demos on Hugging Face, mobile, and YouTube.

What sizes are available in the Gemma 4 family?

The Gemma 4 family consists of four variants: effective edge models E2B and E4B, a 26B Mixture-of-Experts (MoE) model with about 4B active parameters, and a 31B dense model. These are designed for various hardware, including consumer GPUs like RTX 4090.

What license does Gemma 4 use?

Gemma 4 models are fully released under the Apache 2.0 license, allowing broad commercial and research use. They are available on Hugging Face with trending downloads.

What are the key benchmark scores for Gemma 4?

Gemma 4 achieves GPQA 85.7% and AIME 89%. In MLX 8-bit, it scores MMMU-Pro 76.9% and MATH-Vision 85.6%, outperforming larger proprietary models.

Can Gemma 4 run on consumer hardware?

Yes, the 26B MoE model runs on a single RTX 4090 with 162 tokens/second decode speed and 8,400 tokens/second prompt processing. The 31B model requires 24GB VRAM in Q4 quantization.

What is Google AI Edge Eloquent?

Google AI Edge Eloquent is a free on-device voice recognition tool quietly released for iOS. It enables voice processing without cloud dependency.

Is Gemma 4 multimodal?

Yes, Gemma 4 is multimodal, handling text and image inputs, with support for audio and text inference as detailed in its model card. It represents frontier multimodal intelligence on device.

How popular is Gemma 4?

Gemma 4 is #1 on Hugging Face trending and downloads. It has numerous YouTube demos and developer guides highlighting its performance.

Gemma 4 E2B/E4B/26B/31B Apache 2.0 (HF/mobile/YT demos, GPQA 85.7%/AIME 89%/MMMU-Pro 76.9%); #1 HF trending; INT4 quantized models now on HF via GoogleAI/IntelAI for edge inference; + AI Edge Eloquent iOS voice recog.

Sources (27)

Updated Apr 8, 2026

AI Model Release Tracker

Google Gemma 4 open multimodal/agentic family + edge SOTAs + INT4 quants

Key Questions

What is Google Gemma 4?

What sizes are available in the Gemma 4 family?

What license does Gemma 4 use?

What are the key benchmark scores for Gemma 4?

Can Gemma 4 run on consumer hardware?

What is Google AI Edge Eloquent?

Is Gemma 4 multimodal?

How popular is Gemma 4?

Google quietly released 'Google AI Edge Eloquent,' a free voice ...

@_akhaliq: Agentic-MME What Agentic Capability Really Brings to Multimodal Intelligence? paper: https://t.co/...

From Google Blog - Google Just Dropped Gemini 3… It’s Insane

Google Gemma 4 Explained: Why Google's Apache 2.0 Open Model ...

Google Just Dropped Gemma 4 + Veo 3.1 Lite And Quietly Killed the Cloud ...

Google Gemma 4 Just Dropped — Math Score Went From 20% to 89%

Google Gemma 4: The Open-Source AI Model Changing the Game

[AINews] Gemma 4: The best small Multimodal Open Models ...

Google Gemma 4 Developer Guide: Benchmarks & Local Setup

Google Gemma 4 Explained 🚀 | Features, Benchmarks & Use Cases

Google Gemma 4 Deep Dive: Architecture, MoE & Benchmarks

Welcome Gemma 4: Frontier multimodal intelligence on device

Gemma 4 31B - Intelligence, Performance & Price Analysis

Google Gemma 4: Open Models That Beat Giants 20x Their Size

@ClementDelangue reposted: Gemma 4 26B MoE (4B active) on a single RTX 4090: - 162 t/s decode - 8,400 t...

Google releases Gemma 4 open models

Bring state-of-the-art agentic skills to the edge with Gemma 4

@ClementDelangue reposted: Meet Gemma 4! Purpose-built for advanced reasoning and agentic workflows on the...

@Scobleizer reposted: Exciting news for Jetson developers 🎉 Gemma 4 is now on Jetson. @GoogleGemma’s ...

@jeremyphoward reposted: Google Deep Mind's impressive fully-open Gemma 4 is live day-zero on Modular Clo...

How to build on-device AI with Gemma 4

Google launches open-source model Gemma 4: How to try it | Mashable

Google launches Gemma 4: four open-weight models from smartphones to workstations

Google releases Gemma 4 under Apache 2.0 — and that license change may matter more than benchmarks

Google’s bold Gemma 4 bet targets Meta’s hold on developers

Google Jumps Back Into the Open Source AI Race With Gemma 4

Google releases its most powerful open-source AI models yet, that's free to use commercially

****************Google Gemma 4 open multimodal/agentic family + edge SOTAs + INT4 quants****************

Key Questions

What is Google Gemma 4?

What sizes are available in the Gemma 4 family?

What license does Gemma 4 use?

What are the key benchmark scores for Gemma 4?

Can Gemma 4 run on consumer hardware?

What is Google AI Edge Eloquent?

Is Gemma 4 multimodal?

How popular is Gemma 4?

Google quietly released 'Google AI Edge Eloquent,' a free voice ...

@_akhaliq: Agentic-MME What Agentic Capability Really Brings to Multimodal Intelligence? paper: https://t.co/...

From Google Blog - Google Just Dropped Gemini 3… It’s Insane

Google Gemma 4 Explained: Why Google's Apache 2.0 Open Model ...

Google Just Dropped Gemma 4 + Veo 3.1 Lite And Quietly Killed the Cloud ...

Google Gemma 4 Just Dropped — Math Score Went From 20% to 89%

Google Gemma 4: The Open-Source AI Model Changing the Game

[AINews] Gemma 4: The best small Multimodal Open Models ...

Google Gemma 4 Developer Guide: Benchmarks & Local Setup

Google Gemma 4 Explained 🚀 | Features, Benchmarks & Use Cases

Google Gemma 4 Deep Dive: Architecture, MoE & Benchmarks

Welcome Gemma 4: Frontier multimodal intelligence on device

Gemma 4 31B - Intelligence, Performance & Price Analysis

Google Gemma 4: Open Models That Beat Giants 20x Their Size

@ClementDelangue reposted: Gemma 4 26B MoE (4B active) on a single RTX 4090: - 162 t/s decode - 8,400 t...

Google releases Gemma 4 open models

Bring state-of-the-art agentic skills to the edge with Gemma 4

@ClementDelangue reposted: Meet Gemma 4! Purpose-built for advanced reasoning and agentic workflows on the...

@Scobleizer reposted: Exciting news for Jetson developers 🎉 Gemma 4 is now on Jetson. @GoogleGemma’s ...

@jeremyphoward reposted: Google Deep Mind's impressive fully-open Gemma 4 is live day-zero on Modular Clo...

How to build on-device AI with Gemma 4

Google launches open-source model Gemma 4: How to try it | Mashable

Google launches Gemma 4: four open-weight models from smartphones to workstations

Google releases Gemma 4 under Apache 2.0 — and that license change may matter more than benchmarks

Google’s bold Gemma 4 bet targets Meta’s hold on developers

Google Jumps Back Into the Open Source AI Race With Gemma 4

Google releases its most powerful open-source AI models yet, that's free to use commercially

Google Gemma 4 open multimodal/agentic family + edge SOTAs + INT4 quants