Uncertainty Mitigation and Intent Inference: A Dual-Mode Human-Machine Joint Planning System

This paper proposes a dual-mode human-robot joint planning system that combines an LLM-assisted active elicitation mechanism with real-time intent inference to effectively mitigate task-relevant knowledge gaps and latent human intent, significantly reducing interaction costs and execution time in open-world environments.

Zeyu Fang, Yuxin Lin, Cheng Liu, Beomyeol Yu, Zeyuan Yang, Rongqian Chen, Taeyoung Lee, Mahdi Imani, Tian Lan

Published Tue, 10 Ma
📖 4 min read☕ Coffee break read

Imagine you are a rescue worker trying to navigate a chaotic, smoke-filled building with a robot drone. In the old days, the robot was like a very obedient but slightly confused dog: you had to give it very specific commands ("Go to the red box, then turn left"), and if you didn't, it would either get stuck or guess wrong. It couldn't really "think" about what you meant or ask for help.

This paper introduces a new kind of robot teammate that acts more like a smart, proactive human partner. It solves two big problems that happen when humans and robots work together in the real world: not knowing what to do and not knowing what the human is thinking.

Here is how their system works, broken down into two main "modes" using simple analogies:

Mode 1: The "Clarifying Detective" (Uncertainty Mitigation)

The Problem: You tell the robot, "Go get the medicine and bring it to the injured person." But there are three boxes in the room, and you didn't say which one has the medicine. Also, there's a net and some smoke blocking the path, and you aren't sure if the robot can fly through them.

The Old Way:

  • The "Guessing" Robot: Just picks a box (maybe the wrong one) and flies into the smoke (crashing).
  • The "Over-Questioning" Robot: Asks you about every single thing ("Is the blue box safe? Is the red box safe? Is the net safe?"). This wastes time and annoys you.

The New Way (This Paper's Solution):
The robot acts like a smart detective.

  1. It thinks first: It uses a "brain" (an AI) to guess which box is most likely the medicine box.
  2. It asks the right question: Instead of asking about everything, it calculates the most efficient path. It realizes, "If I ask about the fire, I might not need to ask about the net later." So, it only asks you: "Is the fire blocking the path?"
  3. The Result: It solves the puzzle with half the questions compared to other methods. It saves time and energy by only asking for the specific information it needs to make a safe plan.

Mode 2: The "Mind-Reading Partner" (Intent Inference)

The Problem: You and the robot are working together to save a person. You start walking toward the injured person, but you haven't said a word. The robot needs to decide: Should I follow you? Should I go get the bandages? Should I clear the path?

The Old Way:

  • The "Clueless" Robot: It just waits for you to speak. If you don't speak, it stands still or does something random. It doesn't realize you are already moving toward the patient.

The New Way (This Paper's Solution):
The robot acts like a mind-reading partner who watches your body language.

  1. It watches your moves: It tracks where you are walking and which way you are facing.
  2. It guesses your goal: It calculates, "You are walking toward the injured person. You probably want me to help there."
  3. It acts without being told:
    • If the task is cooperative (both need to be there), the robot rushes to help you immediately.
    • If the task is independent (you are doing one thing, it can do another), it doesn't get in your way. Instead, it grabs the bandages from the other side of the room so you don't have to.
  4. The Result: You don't have to stop and give orders. The robot just knows what to do, making the whole team move faster and smoother.

The Real-World Test

The researchers didn't just write this on paper; they built a real drone and tested it in a simulated building and a real room with obstacles like nets and smoke.

  • The "Detective" Mode: The robot asked 52% fewer questions than the "over-questioning" robot but still got the job done 100% of the time.
  • The "Mind-Reader" Mode: The team finished their mission 25% faster because the robot stopped waiting for orders and started helping immediately.

The Big Picture

Think of this system as upgrading a robot from a remote-controlled car (which needs constant, perfect instructions) to a co-pilot (which can read the map, ask smart questions when confused, and anticipate your next move).

By combining smart questioning (to fix confusion) and body-language reading (to guess intent), this system creates a robot that feels less like a machine and more like a true teammate you can trust in a crisis.