EquiBim: Learning Symmetry-Equivariant Policy for Bimanual Manipulation

This paper introduces EquiBim, a model-agnostic framework for bimanual robot imitation learning that enforces bilateral symmetry equivariance between observations and actions, thereby improving performance and robustness across diverse observation modalities and action representations in both simulation and real-world dual-arm systems.

Zhiyuan Zhang, Aditya Mohan, Seungho Han, Wan Shou, Dongyi Wang, Yu She

Published 2026-03-10
📖 4 min read☕ Coffee break read

Imagine you are teaching a pair of identical twins how to bake a cake together. You show them a video of the process: the left twin holds the bowl while the right twin stirs.

Now, imagine you flip the video horizontally. Suddenly, the right twin is holding the bowl and the left twin is stirring. Because they are identical twins in a symmetrical kitchen, this flipped video shows a perfectly valid way to bake the cake, too.

The Problem:
Most current robot learning systems are like students who memorize the video frame-by-frame without understanding the logic. If you show them the "flipped" video (where the right twin holds the bowl), the robot might get confused. It might try to have the left arm hold the bowl and the right arm hold the bowl at the same time, or it might freeze because it doesn't recognize the new setup. It lacks the "common sense" that the two arms are interchangeable and the task is symmetrical.

The Solution: EquiBim
The paper introduces EquiBim, a new training method that teaches robots this "common sense" explicitly. Think of EquiBim as a strict but helpful coach who says to the robot: "Hey, if you see a task where the left arm does X and the right arm does Y, and then you see a mirrored version where the arms swap places, your brain must tell the arms to swap their actions too. If the scene flips, your plan must flip with it."

How It Works (The Analogy)

  1. The Mirror Test:
    Imagine the robot is looking at a scene through a mirror. EquiBim takes the robot's view, creates a perfect mirror image of it, and asks the robot: "Okay, if the world looks like this mirror image, what should the arms do?"

    • The Old Way: The robot guesses based on what it saw before, often getting it wrong because it didn't expect the mirror.
    • The EquiBim Way: The robot is forced to learn that if the world flips left-to-right, the action plan must also flip left-to-right. It's like a dance instructor telling a partner, "If I step left, you must step right. If I turn this way, you turn that way."
  2. No New Hardware, Just New Rules:
    Usually, to make a robot smarter, you have to build a new, more complex brain (a new neural network architecture). EquiBim is different. It's like adding a rulebook to an existing brain. You don't need to rebuild the robot's head; you just add a constraint during training that says, "Your answers must be consistent with the mirror image." This makes it easy to plug into any existing robot learning system.

  3. Why It Matters:

    • Robustness: In the real world, things aren't always perfect. Objects might be placed slightly differently, or the lighting might change. Because EquiBim teaches the robot to understand the symmetry of the task, it doesn't panic when the situation changes slightly. It knows, "Oh, the object moved to the left, so I just need to swap my arm roles."
    • Learning Faster: By understanding that the left and right sides are interchangeable, the robot effectively doubles its learning data. Every time it learns a move with the left arm, it automatically learns the mirrored move for the right arm.

The Results

The researchers tested this on a dual-armed robot (two arms working together) in both computer simulations and the real world.

  • In the simulation: The robots trained with EquiBim were much better at tasks like stacking blocks, passing objects, or pressing staplers. They were less likely to crash or get confused.
  • In the real world: When they tried tasks like handing a banana from one arm to the other, or hanging a toy chicken on a hook, the EquiBim-trained robots were much more reliable, especially when the objects were placed in new or unexpected positions.

The Bottom Line

EquiBim is a simple but powerful trick. It tells robots: "You have two identical arms, and the world is often symmetrical. Don't just memorize specific moves; learn the rule that if the world flips, your plan should flip too." This makes robots smarter, more adaptable, and much better at working together with two hands.