Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation
John Z. Zhang1,2, Maks Sorokin2*, Jan Brüdigam2*, Brandon Hung2*, Stephen Phillips2, Dmitry Yershov2,
Farzad Niroui2, Tong Zhao2, Leonor Fermoselle2, Xinghao Zhu2, Chao Cao2, Duy Ta2,
Tao Pang2, Jiuguang Wang2, Preston Culbertson2,3, Zachary Manchester1, and Simon Le Cléac'h2
1MIT   2RAI Institute   3Cornell
*Equal Contribution
This work was done in part during an internship at the RAI Institute.
Corresponding Email: jzhang3@mit.edu
Abstract

This paper presents a sim-to-real approach that enables legged robots to dynamically manipulate large and heavy objects with whole-body dexterity. Our key insight is that by performing test-time steering of a pre-trained whole-body control policy with a sample-based planner, we can enable these robots to solve a variety of dynamic loco-manipulation tasks. Interestingly, we find our method generalizes to a diverse set of objects and tasks with no additional tuning or training, and can be further enhanced by flexibly adjusting the cost function at test time. We demonstrate the capabilities of our approach through a variety of challenging loco-manipulations tasks on a Spot quadruped robot in the real world, including uprighting a tire heavier than the robot's nominal lifting capacity and dragging a crowd-control barrier larger and taller than the robot itself. Additionally, we show that the same approach can be generalized to humanoid loco-manipulations tasks, such as opening a door and pushing a table, in simulation.

Methods
System Overview Dynamics Comparison

System Overview: (left) Sumo takes an hirarchical approach that combines a pre-trained whole-body control policy (purple) with a high-level sample-based MPC (yellow). The whole-body control policy takes in the current state and desired torso, arm, and leg commands and outputs the joint-level commands for the quadruped or humanoid robots at $50$Hz. The high-level sample-based MPC aims to minimize a task-specific cost function by performing dyanmics rollouts with the policy-in-the-loop (right).

Spot Loco-Manipulation

Note: the Spot robot has a peak lift capacity of 11kg and a continuous load capacity of 5kg.

G1 Loco-Manipulation
Experimental Analysis
BibTeX
      
      @article{2025sumo,
        title={Sumo: Dynamic and Generalizable Whole-Body Loco-Manipulation},
        author={[Authors to be added]},
        journal={[Journal/Conference to be added]},
        year={2025}
        }