Whole-Body Impedance MPC for Safe Human-Robot Contact

A new whole-body control architecture lets floating-base humanoids absorb sustained human contact forces with zero steady-state error, using a Kalman-augmented receding-horizon QP that runs in the robot's residual null space at over 1 kHz.

Ask a humanoid robot to stand on two feet and then lean on it, and you expose one of the hardest problems in the field at once. The robot is a floating-base system: unlike an industrial arm bolted to the floor, nothing holds it up except the contact forces at its feet, and those forces are constrained, you cannot pull on the ground, only push. It must keep its balance against gravity and its own motion while a human applies an external force that the robot did not plan for. Existing whole-body control frameworks tend to handle this badly. Either they commit the robot's entire joint space to the locomotion-and-balance problem, leaving nothing to gracefully accommodate the human, or they fall back on fixed-gain impedance, a spring-and-damper response that feels compliant but slowly accumulates steady-state error: lean on it long enough and the robot quietly drifts off its target posture and never recovers.

A new paper, Whole-Body Impedance Model Predictive Control for Safe Physical Human-Robot Interaction on Floating-Base Platforms (arXiv:2606.14617) by Yongyan Cao, attacks exactly this gap. It extends a previously published fixed-base, two-layer Impedance MPC, designed for a robot anchored to the ground, to the much harder floating-base case, and it does so with an architecture engineered to be both provably correct under sustained contact and fast enough to run on real hardware.

“Floating-base robots must balance under rigid contact constraints while interacting safely with humans.”— arXiv:2606.14617 source

Three levels, each doing one job

The design is a three-level stack, and the separation is the point. At the top, a centroidal MPC plans the robot's contact forces over a 500 millisecond horizon. Centroidal models reduce the full-body dynamics to the motion of the robot's center of mass and overall momentum, which is the right abstraction for balance: it captures whether the robot will tip without drowning in the detail of every joint. Planning contact forces ahead over half a second lets the controller anticipate rather than merely react.

In the middle sits a priority-driven whole-body control layer that resolves the balance objective into actual joint torques. It does this through contact-consistent null-space projection, a technique that enforces the high-priority balance task first and then expresses everything else in the mathematical space of motions that do not disturb that task. This is what keeps balance sacrosanct: lower-priority objectives are only allowed to use the degrees of freedom that remain after balance has taken what it needs.

The bottom level is where the human comes in. The residual null space, the spare degrees of freedom left over after balance is satisfied, is governed by a receding-horizon quadratic program (QP) that predicts and rejects physical human-robot interaction (pHRI) disturbances. Crucially, it estimates those disturbances using a Kalman-augmented state. Rather than passively springing back against a push, this layer forms a running estimate of the external force a human is applying and actively plans to counter it, all within the freedom that balance has left available. The architecture's elegance is that human accommodation never competes with staying upright; it lives strictly in the leftover space.

The engineering that makes it real-time

A predictive QP in the loop sounds expensive, and on a humanoid the control rate is non-negotiable: balance controllers need to run at roughly a kilohertz to remain stable. The paper's answer is a structural simplification. Through a contact-consistent feedback linearization, the arm's end-effector dynamics are reduced to a double integrator, a clean point-mass-like model, with a constant state matrix within each contact mode. Because that matrix does not change while the robot's feet stay planted the same way, the cost terms of the QP can be precomputed offline. The expensive math is done once, ahead of time, and the online loop becomes cheap enough to run at 1 kHz or faster. This is the kind of detail that separates a controller that works in a paper from one that works on a robot: the theory is shaped specifically so the real-time budget is met.

The contact-switch problem, and a guarantee

The genuinely subtle issue with floating-base robots is contact-mode switching. A walking humanoid is constantly changing which feet are on the ground, and each switch changes the dynamics and the null-space structure. A disturbance estimator that resets at every switch would lose its hard-won estimate of the human's force precisely when continuity matters most. The paper introduces a covariance-inflation protocol that preserves the disturbance estimate across contact-mode switches, carrying the Kalman estimator's knowledge through the discontinuity by widening its uncertainty rather than discarding it.

That preservation is what enables the central theoretical claim: the controller guarantees zero steady-state error under bounded, constant pHRI loads. In plain terms, if a human applies a steady push, the robot holds its target posture exactly, with no residual drift, the failure mode that plagues fixed-gain impedance. The paper also proves an Impedance Equivalence Theorem showing that in the infinite-horizon limit the predictive controller recovers a classical task-space impedance law, but with a twist: the effective mass, damping, and stiffness adapt to the robot's posture and contact configuration rather than staying fixed. This matters because it situates the method within decades of well-understood impedance control theory while explaining precisely how it improves on it, the predictive structure yields the familiar spring-damper feel, plus disturbance rejection, plus posture-aware gains, plus the steady-state guarantee.

Why it matters

The trajectory of humanoid robotics is moving from carefully choreographed demos toward robots that share space with people and are expected to be touched, guided, leaned on, and occasionally jostled. Safe physical human-robot interaction is the gating capability for that future, and it is fundamentally a control problem: the robot must be compliant enough to be safe and accommodating, yet stable enough never to fall, and it must hold its ground under sustained, not just momentary, contact. The steady-state drift of fixed-gain impedance is exactly the kind of flaw that is invisible in a quick demo and disqualifying in deployment.

The validation, simulations on a 17-DOF biped and on the Unitree G1, a commercially available humanoid, signals that the method targets realistic platforms rather than an idealized abstraction. The honest caveat is that the results are in simulation, and the steady-state guarantee is established for bounded, constant contact loads; real human interaction is messier, with time-varying and impulsive forces that sit outside that proven envelope. Still, the contribution is the architecture's shape. By cleanly separating balance, torque resolution, and predictive disturbance rejection, and by engineering each layer so the whole stack runs at hardware speed with a provable steady-state property, the paper offers a template for how compliant humanoids can be made both safe to touch and reliably upright, which is the combination the next phase of the field actually needs.

Letting Humanoids Be Pushed Without Falling: A Three-Level Impedance MPC for Physical Human-Robot Contact

Three levels, each doing one job

The engineering that makes it real-time

The contact-switch problem, and a guarantee

Why it matters

Comments