Telogenesis: Goal Is All U Need

This paper proposes and validates "Telogenesis," an endogenous attention mechanism that generates adaptive priorities from internal epistemic gaps (ignorance, surprise, and staleness) without external rewards, demonstrating superior coverage and detection latency compared to fixed strategies while spontaneously recovering latent environmental volatility structures.

Zhuoran Deng, Yizhi Zhang, Ziyi Zhang, Wan Shen

Published Wed, 11 Ma
📖 5 min read🧠 Deep dive

Imagine you are a detective trying to solve a mystery in a giant, dark warehouse filled with 50 different rooms. You only have enough flashlight battery power to check one room at a time.

Your goal isn't just to "look around"; it's to figure out which room has changed the most recently. Maybe someone moved a box in Room 4, or a light flickered in Room 12. If you miss these changes, you fail.

The paper you shared, titled "Telogenesis: Goal Is All U Need," is about how a robot (or an AI) can decide which room to check next without being told what to do.

Here is the simple breakdown of their discovery:

1. The Old Way vs. The New Way

The Old Way (The "Coverage" Strategy):
Imagine a security guard who walks in a perfect circle, checking Room 1, then Room 2, then Room 3, and so on, no matter what.

  • Pros: Eventually, they see everything.
  • Cons: If a fire starts in Room 4, the guard might not get there for a long time if they are currently in Room 40. They are slow to react to changes.

The New Way (The "Telogenesis" Strategy):
Imagine a detective who doesn't have a list. Instead, they have an internal "hunch meter" that tells them where to look based on three feelings:

  1. Ignorance (The "I don't know" feeling): "I haven't looked at Room 12 in a while. I'm totally blind to it. I should check it."
  2. Surprise (The "Wait, what?" feeling): "I looked at Room 5 yesterday and it was blue. Today it's red! Something changed! I need to look at it again immediately."
  3. Staleness (The "It's been too long" feeling): "I haven't looked at Room 9 since breakfast. Even if I didn't see a change yesterday, it's been so long that it might have changed by now. I should check it."

The AI combines these three feelings into a single score. The room with the highest score gets the flashlight. This is Telogenesis: creating your own goals (where to look) from the inside out, rather than waiting for a boss to tell you.

2. The Big Surprise: It Depends on How You Measure Success

The researchers found something weird and fascinating about how we judge these detectives.

  • Scenario A: The "God's Eye View" Judge
    If a judge looks at the entire warehouse and says, "You failed because you didn't know the color of the walls in 40 rooms," the Old Way (circular walking) wins. It's better at knowing the average state of everything.
  • Scenario B: The "Real World" Judge
    But in the real world, the detective doesn't know what's in the rooms they haven't visited. The only thing that matters is: "How fast did you notice the fire?"
    When the researchers measured success by speed of detection, the New Way (the hunch-based detective) crushed the Old Way.
    • As the warehouse got bigger (more rooms), the Old Way got slower and slower.
    • The New Way stayed fast, no matter how big the warehouse got.

The Lesson: If you are limited in attention (like a human or a robot with limited battery), you shouldn't try to know everything. You should try to notice changes as fast as possible.

3. The "Magic" Discovery: Learning Without a Teacher

In the final experiment, the researchers made the detective even smarter. They let the detective learn which rooms change often.

  • The Setup: Some rooms were "chaotic" (changing every few minutes). Other rooms were "boring" (staying the same for hours). The detective was not told which was which.
  • The Result: The detective started paying more attention to the chaotic rooms and less to the boring ones.
  • How? The "Staleness" feeling (the "it's been too long" timer) automatically sped up for the chaotic rooms because they kept surprising the detective. The timer slowed down for the boring rooms.

The system taught itself the structure of the environment just by paying attention to its own confusion and surprise. It didn't need a teacher, a reward, or a manual. It just needed to notice what it didn't know.

The Takeaway

The title "Goal Is All U Need" is a play on a famous AI paper title ("Data is all you need").

The authors are saying: You don't need an external boss to tell you what to do. If you have a brain that tracks what it doesn't know (ignorance), what surprises it (error), and what it's ignored for too long (staleness), you can generate your own goals.

In a world where we can't look at everything at once, the best strategy isn't to try to know everything. It's to listen to your own curiosity and rush toward the things that are most likely to have changed.

In short: Don't wait for a goal to be given to you. Your own confusion and curiosity are the goals you need to survive.