UB-SMoE Fixes Expert Imbalance in Heterogeneous Federated Tuning
UB-SMoE resolves client heterogeneity-induced load imbalance in federated fine-tuning via Dynamic Modulated Routing for expert rebalancing and...

Created by Jaime S
Latest AI models, benchmarks, algorithms, and applications across robotics, healthcare, coding
Explore the latest content tracked by AI Innovation Radar
UB-SMoE resolves client heterogeneity-induced load imbalance in federated fine-tuning via Dynamic Modulated Routing for expert rebalancing and...
Traditional benchmarks measure model smarts on exams, but GAIA checks if agents can actually complete work.
AdaJEPA turns frozen world models into adaptive ones by closing the loop between planning and learning. After each MPC step, the model executes its...
GPT-5.6 Sol scored 88.8% on TerminalBench 2.1, nearly ten points ahead of Claude Opus 4.8, with the Ultra variant reaching 91.9% via parallel...
The article investigates whether LLMs reason equally well across 43 languages.
PAW reframes LLMs as compilers that generate reusable 23MB adapters instead of answering queries repeatedly.
OpenAI previewed its GPT-5.6 family of three vision-language models (Sol, Terra, Luna) with tiered pricing and performance, currently restricted to...
Large-scale evaluations reveal simpler ML often matches expensive tabular foundation models for routine clinical predictions.
A developer reports switching completely to open models, using GLM-5.2 daily in Claude Code via Hugging Face Inference Providers and hf-claude. Open models are becoming easier to plug directly into real developer workflows.
Two recent advances highlight efficient inference without heavy retraining or custom hardware:
High exam scores hide critical failures in clinical LLMs. Models hitting 92% on licensing tests plummet to 44.8% on real EHR benchmarks like BRIDGE,...
New benchmarks target real agent weaknesses instead of final scores.
Diffusion language models are shifting from experimental releases to practical tools, showing clear speed and flexibility edges over autoregressive...
Meta's Watermelon reportedly matches GPT-5.5 on undisclosed benchmarks, marking its next frontier push after Muse Spark while Zuckerberg admits slower-than-expected AI progress.
WorldDirector decouples semantic motion from pixel rendering via LLM-orchestrated 3D trajectories, delivering strict physical consistency and...