Applied AI Research Digest

225 posts

Updated 11h ago

84 scanned

SeaCache proposes a spectral-evolution-aware cache designed for accelerating diffusion models. Ideal for compute savings in generative AI products—join the discussion.

Dynamic routing adapts cognitive depth per step—from intuitive to reflective—grounded in ACT-R theory
Two-stage training: CoSFT for stable...

GUI-Libra trains native GUI agents to reason and act using action-aware supervision and partially verifiable RL – a practical recipe for deployable, reliable navigation in desktop/mobile products.

Midtraining Techniques

🔥 Bridging Pre- and Posttraining: New preprint uses controlled experiments to show midtraining helps most when it...

Practical lever for agent reliability:

Trace-Free+ uses curriculum learning on execution traces to rewrite human-centric tool descriptions for...

Midtraining helps most when it bridges pretraining and posttraining, mitigating forgetting—per controlled experiments in new preprint on training pipelines.

Key RL recipes to combat interaction collapse in open-weight multimodal agents:

Oversampling-filtering-ranking rollouts + accumulative tool rewards...

New Paper Pages

🔥 One-step Language Modeling via Continuous Denoising: Join the discussion on this paper page.
K-Search: LLM Kernel...

One-step language modeling via continuous denoising – a promising efficiency leap for low-latency LM inference in products.

Key technique for scarce paired data: UML trains a single model alternately on unpaired inputs from modalities like text/audio/images, sharing...

Unified framework standardizes Vision-Language-Action (VLA) design for robotics
Ablates SigLIP2 vision encoder and LLaMA-3.2 backbone, revealing...

COW CORPUS introduces a dataset of 400 real-user web navigation trajectories to model human intervention.

Key highlights:

Identifies four distinct...

K-Search advances LLM kernel generation using co-evolving intrinsic world models, offering practical techniques for agentic search and planning.

Mobile-O introduces unified multimodal understanding and generation optimized for mobile devices, enabling efficient on-device deployment in product apps.

Core ML Generation & Optimization

🔥 FMLM: One-Step LLM via Continuous Denoising: Paper introduces flow-based language model (FLM) using...

Flow-based FMLM delivers one-step generation rivaling prior 8-step quality, via continuous denoising on one-hot tokens for superior stability.
-...

Adam improves Muon by integrating adaptive moment estimation with orthogonalized momentum, targeting efficiency in adaptive optimizers for industry-scale ML.

DeepVision-103K launches as a visually diverse, broad-coverage, and verifiable mathematical dataset for multimodal reasoning.

Key strengths for...

FAMOSE uses ReAct LLM agents to automate feature discovery and selection for tabular data:

Explores, generates, refines features with real-time...

Key findings from RCT on viral reverse genetics workflows:

153 novices compared LLM assistance vs. internet search; no sig. difference in overall...

Reinforcement learning applications to energy systems, microgrids, powertrains, humanoid walking, and industrial scheduling

Benchmarks, evaluation suites, and collaborative agent frameworks

Reinforcement learning algorithms, reward modeling, and adaptive reasoning depth for language and multimodal reasoning models

World-model-based agents, vision-language-action models, and embodied control using RL

Runtime memory architectures, agent benchmarks, and evaluation of skills and robustness

Stabilizing RL fine‑tuning, trust regions, reward modeling, and optimization methods for LLMs and diffusion models

Interactive world models, vision‑language‑action architectures, and robotic control with RL

Efficient test‑time compute scaling, discrete diffusion, and few‑step generation for text and multimodal models

RL frameworks, data selection, and post‑training pipelines for agentic models

Quantization, compression, and optimizer innovations for efficient large models

Recent Posts

SeaCache: Spectral-Evolution-Aware Caching for Faster Diffusion Models

SeaCache: Spectral-Evolution-Aware Cache for Accelerating Diffusion Models

CogRouter: Step-Level Adaptive Cognition Slashes LLM Agent Compute Waste

GUI-Libra: Action-Aware Framework for Reliable Native GUI Agents

GUI-Libra: Training Native GUI Agents to Reason and Act with Action-aware Supervision and Partially Verifiable RL