Cognitive Engineering Frontier

Mamba series & DUET/M2RNN/PAREformer/gated cond injection — SSM-hybrid resurgence

Mamba series & DUET/M2RNN/PAREformer/gated cond injection — SSM-hybrid resurgence

Key Questions

What is the Mamba series resurgence?

The Mamba series resurgence includes Mamba-3, DUET, M2RNN, PAREformer, TurboQuant, TAPS, and LongCat-Next. These SSM-hybrid models support 16K+ context lengths with gated conditional injection.

What are the key features of Gemma 4?

Gemma 4 is a 26B MoE model (4B active) achieving 162 tokens/s decode on RTX 4090 and 262K multimodal/edge SOTA. It runs frontier AI on single GPUs with Apache 2.0 licensing.

What is MegaTrain?

MegaTrain enables full-precision training of 100B+ parameter LLMs on a single GPU. It advances efficient training for SSM-hybrid architectures.

How does Gemma 4 perform on edge devices?

Gemma 4 delivers high performance on single Nvidia GPUs like RTX 4090, with 162 t/s decode and 8,400 t/s prefetch. It supports data centers and edge devices multimodally.

What is gated condition injection?

Gated condition injection enables controllable linear-attention transformers without multimodal attention. It supports long contexts in SSM-hybrid models like those in the Mamba series.

What benchmarks are ongoing for these models?

Ongoing benchmarks and local demos evaluate dynamical agent scaling and test-time performance. Models like Gemma 4 set edge SOTA in multimodal tasks.

What is the significance of DUET and M2RNN?

DUET and M2RNN are part of the SSM-hybrid resurgence, improving efficiency over transformers. They contribute to scalable, long-context processing.

How does PAREformer fit into this trend?

PAREformer advances SSM architectures with efficient representations. It aligns with the explosion in hybrid models supporting agentic and high-throughput applications.

Mamba-3/DUET/M2RNN/PAREformer/TurboQuant/TAPS/LongCat-Next/gated cond 16K+ ctx; Gemma4 26B MoE 162t/s/RTX 262K multimodal/edge SOTA; MegaTrain single-GPU 100B+ full-precision training; dynamical agent/test-time scaling. Ongoing benchmarks/local demos.

Sources (9)
Updated Apr 8, 2026