Sim2Sea: Sim-to-Real Policy Transfer for Maritime Vessel Navigation in Congested Waters

The paper proposes Sim2Sea, a comprehensive framework featuring a GPU-accelerated simulator, a dual-stream spatiotemporal policy with safety-guided action masking, and targeted domain randomization, which successfully enables zero-shot sim-to-real transfer of autonomous navigation for a 17-ton unmanned vessel in congested maritime waters.

Xinyu Cui, Xuanfa Jin, Xue Yan, Yongcheng Zeng, Luoyang Sun, Siying Wei, Ruizhi Zhang, Jian Zhao, Haifeng Zhang, Jun Wang

Published 2026-03-05
📖 5 min read🧠 Deep dive

Here is an explanation of the Sim2Sea paper, translated into simple, everyday language with creative analogies.

🚢 The Big Picture: Teaching a Boat to Drive Itself

Imagine you want to teach a toddler how to drive a massive, heavy cruise ship through a crowded harbor filled with other boats, fishing nets, and narrow channels.

If you tried to teach them by throwing them into the real ocean, they would likely crash immediately. It's too dangerous, too expensive, and the real ocean is unpredictable.

Sim2Sea is a new "training school" that solves this problem. It allows a computer brain (an AI) to learn how to navigate a ship entirely inside a video game, and then—magically—step out of the game and drive a real 17-ton boat in the real world without crashing.

The authors call this "Sim-to-Real" transfer. Think of it like a flight simulator for pilots, but for ships, and it actually works perfectly on the first try.


🛠️ The Three Secret Ingredients

The researchers built a three-part system to make this magic happen. Here is how they did it:

1. The "Super-Fast" Video Game (The Simulator)

Most video games for ships are slow or fake. They don't feel like real boats. Real boats are heavy, they slide when they turn, and they are affected by wind and currents.

  • The Analogy: Imagine trying to learn to ride a bike on a tricycle that doesn't wobble. It's easy, but you'll crash when you get on a real bike.
  • The Fix: The team built a GPU-accelerated parallel simulator.
    • "Parallel" means they can run 65,000 different training sessions at the exact same time (like having 65,000 virtual boats learning simultaneously).
    • "High-Fidelity" means the physics are real. The virtual boat slides, turns, and reacts to water exactly like a real one.
    • Result: The AI learns faster than ever before because it gets millions of hours of practice in a single day.

2. The "Super-Eyes" and "Safety Guard" (The Brain)

The AI needs to see the world and make safe choices.

  • The Eyes (Dual-Stream Policy):
    • Stream A (Time): The AI looks at a video of the last few seconds to understand momentum. Analogy: A real driver knows that if they are going fast, they can't stop instantly. The AI learns this "inertia."
    • Stream B (Space): The AI looks at a "Bird's Eye View" map (like Google Maps) showing where other boats and land are.
    • The Mix: It combines the "feeling" of movement with the "picture" of the map to make a decision.
  • The Safety Guard (VO-Guided Action Masking):
    • The Problem: Sometimes, the AI gets excited and tries to do something crazy, like driving straight into a wall, just to see what happens.
    • The Fix: They installed a "Safety Guard" based on Velocity Obstacles (VO).
    • Analogy: Imagine playing a video game where the game automatically grays out (disables) all the buttons that would make you fall off a cliff. The AI can still choose from the safe buttons, but it cannot even think about the dangerous ones. This stops the AI from learning bad habits.

3. The "Chaos Training" (Domain Randomization)

This is the most important part for the "Sim-to-Real" jump.

  • The Problem: Even the best video game isn't perfect. The real ocean has weird currents, sensor noise, and delays. If the AI trains in a "perfect" game, it will fail in the messy real world.
  • The Fix: The researchers intentionally broke their simulator during training.
    • Analogy: Imagine a chef practicing a recipe. If they only cook with perfect ingredients in a quiet kitchen, they might fail when the stove is broken or the ingredients are slightly off. So, Sim2Sea trains the AI by randomly changing the water currents, adding fake sensor glitches, and making the boat heavier or lighter every single time.
    • Result: The AI becomes a "champion of chaos." When it finally meets the real ocean, it thinks, "Oh, the current is weird? I've seen weirder in training!" It adapts instantly.

🏆 The Results: From Game to Ocean

The team tested this on a 17-ton unmanned boat in real, crowded waters.

  • The Test: They took the AI that was only trained in the computer game and put it on the real boat. They did zero extra training on the real boat. This is called "Zero-Shot Transfer."
  • The Outcome:
    • The boat navigated smoothly.
    • It avoided collisions with other boats and land.
    • It reached its destination.
    • Crucially: When they tested versions without the "Chaos Training" or without the "Time-Sense," the boats crashed or drove erratically. This proved that both the safety guard and the chaos training were essential.

💡 The Takeaway

Sim2Sea is like a revolutionary driving school. It doesn't just teach a robot how to steer; it teaches it how to feel the water, how to see the future, and how to handle the unexpected. By training in a chaotic, high-speed virtual world, the robot is ready to handle the messy, real world immediately.

This is a huge step forward for autonomous ships, promising safer ports, fewer accidents, and more efficient shipping in the future.