ConEQsA: Concurrent and Asynchronous Embodied Questions Scheduling and Answering

This paper introduces ConEQsA, a novel agentic framework and benchmark designed to address the challenges of concurrent and asynchronous embodied question answering by leveraging shared memory and urgency-aware scheduling to outperform traditional sequential approaches in realistic multi-question scenarios.

Haisheng Wang, Dong Liu, Weiming Zhi

Published 2026-03-04
📖 5 min read🧠 Deep dive

Imagine you are a personal assistant for a busy family. In the old days, your job was simple: someone would ask, "Where are the car keys?" You would walk around the house, find them, and bring them back. Then, you would wait for the next question. This is how current AI robots work today; they answer one question at a time, one by one.

But in real life, life doesn't wait. While you are looking for the keys, someone might yell, "Is the stove on?" and another person might ask, "Where is the dog?" These questions arrive at different times, and some are much more urgent than others. If you keep doing things one by one, the person worried about the stove might have to wait too long, and you might waste time walking back and forth to the kitchen twice.

This paper introduces a new way for robots to handle this chaos. They call it ConEQsA (Concurrent and Asynchronous Embodied Questions Scheduling and Answering).

Here is the breakdown of their solution using simple analogies:

1. The Problem: The "One-Task-at-a-Time" Robot

Current robots are like a single-lane road. If a car (a question) is waiting, the next car has to wait in line. Even if the second car is an ambulance (an urgent question), it still has to wait for the first car to finish its slow journey. This is inefficient and dangerous in real-world scenarios.

2. The Solution: The "Super-Organized Chef"

The authors propose a new system called ConEQsA. Imagine a master chef in a busy kitchen.

  • The Menu (The Questions): Instead of cooking one dish, finishing it, and then starting the next, the chef receives a list of orders that keep coming in.
  • The Shared Memory (The Pantry): The chef doesn't just look at one order in isolation. They remember everything they've seen in the pantry. If they walk to the fridge to get milk for a smoothie, they also notice the eggs are missing. They remember that for the next order that needs eggs. They don't need to walk to the fridge again later; they already know the eggs are gone.
  • The Priority System (The Urgency Bell): The chef has a smart system that rings a bell for urgent orders. If a customer says, "My food is on fire!" (High Urgency), the chef drops everything to handle that, even if they were halfway through chopping vegetables for a salad.

3. How It Works (The Magic Ingredients)

The paper describes three main tricks the robot uses to be this efficient:

  • Shared Group Memory: Instead of forgetting everything after answering one question, the robot keeps a "notebook" of everything it sees. If Question A asks "Is the TV on?" and the robot walks past the TV to answer it, it writes down "TV is off" in the notebook. If Question B later asks "Is the TV on?", the robot checks the notebook and answers immediately without walking anywhere!
  • Smart Scheduling (The Priority Planner): The robot doesn't just pick questions randomly. It calculates a "score" for every question based on:
    • Urgency: Is this a fire alarm or a casual chat?
    • Scope: Is the answer nearby (local) or far away (global)?
    • Rewards: If I go to the kitchen, will I find answers for three different questions at once?
    • Dependencies: Do I need to find the key before I can open the door?
  • Targeted Exploration: The robot doesn't just wander aimlessly. It plans a path that is most likely to answer the most important questions while gathering info for the other questions along the way.

4. The New Test: The "CAEQs" Benchmark

To prove this works, the authors created a new test called CAEQs.

  • Imagine a video game with 40 different rooms (like a house, an office, a factory).
  • In each room, the robot gets 5 questions.
  • Some questions appear at the start, and others pop up later (asynchronously).
  • Some questions are marked "High Priority" (like a safety alert), and others are "Low Priority" (like "what color is the sofa?").
  • The robot is graded not just on getting the right answer, but on how fast it answers the urgent ones and how little walking it does to get all the answers.

5. The Results

When they tested their new "Super-Organized Chef" robot against the old "One-Task-at-a-Time" robots:

  • Faster: It answered urgent questions much faster.
  • Smarter: It wasted less time walking around because it reused information it already found.
  • More Efficient: It answered about 9% of questions just by looking at its memory, without taking a single step!

The Bottom Line

This paper is about teaching robots to stop being rigid and start being flexible. Instead of treating every question as a separate, isolated task, they teach the robot to see the "big picture," prioritize what matters most, and use its memory to save time and energy. It's the difference between a robot that is a slow, single-lane worker and a robot that is a fast, efficient, and responsive team leader.