Robotics Agent Advances
Key Questions
How many VLA models were presented at ICLR 2026?
VLA model submissions surged from 1 to 164 at ICLR 2026. PRTS VLA achieved a 95.9% success rate in real-world robot tasks.
What humanoid robots are entering production scale?
Figure 02 is deploying at BMW, while Agility Digit and Unitree models priced at $25k are scaling production. The ROS versus NVIDIA platform competition continues in this space.
How does PRISM support on-device robot planning?
PRISM distills SLMs to reach 93% of GPT-4o performance for robot planning while running fully on-device. It minimizes human intervention during the distillation process.
What is SUGAR and how does it advance humanoid control?
SUGAR is a pipeline that learns humanoid loco-manipulation directly from human videos with zero-shot sim-to-real transfer on the Unitree G1. It enables scalable learning without task-specific engineering.
What simulation advances does Genesis World 1.0 offer?
Genesis World 1.0 provides robotics simulation that is 10x faster with a reduced sim-to-real gap. NVIDIA Gamma-World extends this to multi-agent scenarios at 24 FPS with zero-shot generalization.
How does GEM improve embodied AI performance?
GEM uses generative supervision to achieve state-of-the-art results across multiple embodied benchmarks. It enhances learning efficiency for physical agents and robots.
What efficiency improvements exist for excavator control?
Efficient model-based RL enables excavator control policies to be learned in just 2.5 hours. High-frequency action chunks in latent space further boost control precision.
What is the focus of the ROS versus NVIDIA platform debate?
The debate centers on open ROS frameworks versus proprietary NVIDIA physical AI stacks for humanoid and robot deployments. Production-scale robots are intensifying this competition.
VLA models surge at ICLR 2026 (1 to 164). PRTS VLA achieves 95.9% real-world success. PRISM distills SLMs for robot planning (93% GPT-4o on-device). Humanoid production scale: Figure 02 at BMW, Agility Digit, Unitree $25k. New: SUGAR pipeline for humanoid loco-manipulation from human videos, zero-shot sim-to-real on Unitree G1. GEM generative supervision for embodied AI (SOTA on multiple benchmarks). Genesis World 1.0 robotics simulation (10x faster, low sim-to-real gap). NVIDIA Gamma-World multi-agent world model (4-player 24 FPS, zero-shot generalization). Efficient model-based RL for excavator control (2.5h learning). High-frequency action chunks in latent space. ROS vs NVIDIA platform battle continues.