Riemannian Dueling Optimization

This paper introduces Riemannian Dueling Normalized Gradient Descent (RDNGD) and Riemannian Dueling Frank-Wolfe (RDFW) algorithms to extend dueling optimization from Euclidean spaces to Riemannian manifolds, providing theoretical complexity guarantees and demonstrating effectiveness through numerical experiments on both synthetic and real-world applications.

Yuxuan Ren, Abhishek Roy, Shiqian Ma

Published 2026-03-03
📖 5 min read🧠 Deep dive

Imagine you are trying to find the lowest point in a vast, foggy valley. In the world of standard math and machine learning, you usually have a map or a compass that tells you exactly which way is "down" (the gradient). You take a step, check the map, take another step, and eventually, you find the bottom.

But what if you don't have a map? What if you can't even see the height of the ground? All you have is a friend who can only answer one question: "Is point A lower than point B?"

This is the challenge of Dueling Optimization. You don't know the numbers; you only know the preferences.

Now, imagine that valley isn't flat like a park. It's a curved surface, like the skin of a basketball or the inside of a saddle. This is a Riemannian Manifold. In this curved world, "straight lines" don't exist; you have to walk along curves called geodesics.

This paper introduces a new way to find the bottom of that curved valley when you only have your "preference friend" and no map.

The Core Problem: The Curved, Blind Hiker

The authors, Yuxuan Ren, Abhishek Roy, and Shiqian Ma, realized that many modern AI problems (like training robots or recommending movies) happen on these curved surfaces.

  • Robots: A robot arm moves in 3D space. Its possible positions form a curved shape (like a sphere).
  • Recommendations: To understand complex user tastes, computers often use "hyperbolic space" (a weird, expanding geometry).

In these places, standard math tools break. If you try to walk "straight" on a sphere, you fall off. And if you can't see the height (the loss function), you can't use standard gradient descent. You only have duels: "Is this robot pose better than that one?"

The Solution: Two New Algorithms

The paper proposes two main strategies to solve this "Curved, Blind Hiker" problem.

1. The "Gentle Nudge" Method (RDNGD)

Imagine you are standing on the curved hill. You can't see down, so you ask your friend: "If I take a tiny step in this random direction, is it better than taking a tiny step in the exact opposite direction?"

  • The Trick: You pick a random direction, take a tiny step forward and a tiny step backward. You ask the friend: "Which one was better?"
  • The Result: If the "forward" step was better, you know the "downhill" direction is roughly in that direction.
  • The Innovation: The authors figured out how to do this on a curved surface. They use a mathematical tool called the Exponential Map to take steps that stay on the surface (like walking along the Earth's surface rather than drilling through the core).
  • Why it's special: They proved that even with this "blind" guessing, you can mathematically guarantee that you will eventually find the bottom, and they calculated exactly how many steps it would take.

2. The "No-Projection" Method (RDFW)

Sometimes, the rules of the game are strict. Imagine you are on a sphere, and you are only allowed to stay on the surface. If you take a step that goes slightly inside the sphere, you have to be "projected" (pushed) back onto the surface. This "pushing back" is computationally expensive and slow.

The authors created a second method, Riemannian Dueling Frank-Wolfe, which is like a "projection-free" hiker.

  • The Analogy: Instead of walking in a direction and then getting pushed back to the surface, this method asks: "If I could walk in a straight line from here, where would I hit the boundary of the allowed area?"
  • It finds the best "boundary point" based on the preference duel and walks directly toward it. This is much faster for complex shapes where "pushing back" is hard to calculate.

Real-World Examples

The paper shows these methods work in two very different scenarios:

  1. Hacking Neural Networks (The "Adversarial Attack"):
    Imagine a hacker trying to trick an AI into misidentifying a stop sign as a speed limit sign. The hacker can't see the AI's internal "score" (the loss function), they can only ask: "Does this slightly altered image fool the AI more than that one?"

    • The Result: The new algorithm found the "perfect" trick image much faster than previous methods, using fewer questions to the AI.
  2. Leveling a Horizon (The "Camera Fix"):
    Imagine taking a photo of a sunset, but the camera was tilted. You want to rotate the image until the horizon is perfectly straight. You don't have a "tilt meter." You just have a human (or a simple algorithm) who looks at two rotated versions and says, "This one looks straighter."

    • The Result: The algorithm rotated the image on a mathematical "sphere" of possible rotations and found the perfect straight line using only these "which looks better?" comparisons.

Why This Matters

Before this paper, if you wanted to optimize something on a curved surface (like a robot arm or a complex recommendation system) and you didn't have exact numbers (only preferences), you were stuck. You had to use slow, inefficient guesswork.

This paper provides the first reliable toolkit for these situations. It bridges the gap between:

  1. Curved Geometry (Riemannian manifolds).
  2. Blind Optimization (Dueling/Preference-based feedback).

It's like giving a hiker a compass that works even when they are blindfolded and walking on a curved planet. This opens the door for smarter robots, better AI recommendations, and more efficient machine learning in the real world.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →