In a deep dive on the Robo Papers podcast, 1X Director of Evaluations Daniel Ho explains how "imagination" via video generation is allowing humanoids to perform zero-shot tasks with minimal robot-specific training data.
NVIDIA GEAR Lab has unveiled DreamZero, a 14-billion parameter World Action Model (WAM) that uses video diffusion to grant robots physical "imagination," enabling zero-shot task completion and rapid adaptation across different robotic embodiments.
In a new interview, Google DeepMind's Carolina Parada discusses the 'agentic' two-part brain of Gemini Robotics 1.5, its 'surprising' ability to transfer skills between different robots, and why she believes the home will be 'one of the last frontiers' for the technology.
In a new technical deep-dive, Tesla's VP of AI and Optimus lead Ashok Elluswamy details the end-to-end neural network strategy for FSD, confirming the same "neural world simulator" and architecture will "seamlessly transfer" to its humanoid robot.
Speaking at MIT, Yann LeCun argued the "big secret" of the humanoid industry is that companies lack the fundamental AI breakthroughs—specifically "world models"—needed to make robots "generally useful" in domestic settings.