OPTIAGENT: A Physics-Driven Agentic Framework for Automated Optical Design

This paper introduces OPTIAGENT, a physics-driven agentic framework that leverages Large Language Models enhanced with a specialized dataset, hybrid training objectives, and a physics-guided reward system to automate the design of functional lens systems, effectively bridging the gap between human expertise and automated optical engineering.

Yuyu Geng, Lei Sun, Yao Gao, Xinxin Hu, Zhonghua Yi, Xiaolong Qian, Weijian Hu, Jian Bai, Kaiwei Wang

Published 2026-03-02
📖 4 min read☕ Coffee break read

Imagine you are trying to build a custom camera lens for a new smartphone. You need it to be sharp, capture a wide view, and let in just the right amount of light.

In the real world, this is a nightmare for computers. It's like trying to solve a 3D puzzle where every piece is made of glass, and if you move one piece by a fraction of a millimeter, the whole picture blurs. Traditionally, only highly trained human experts with years of experience could do this, and even then, it took them weeks of trial and error.

Recently, we got very smart computers called Large Language Models (LLMs) (like the AI behind ChatGPT). These AIs read millions of books and know the definitions of lenses. They can tell you, "A lens is made of glass and bends light." But if you ask them to design a working lens, they fail. They might give you a list of numbers that looks like a lens, but physically, it's impossible to build. It's like an AI writing a recipe for a cake that says "add 500 cups of flour"—theoretically it's a recipe, but in reality, it's a disaster.

Enter OPTIAGENT.

The authors of this paper created a new system called OPTIAGENT. Think of it as taking that smart AI and putting it through a rigorous "boot camp" specifically for physics. Here is how they did it, using some simple analogies:

1. The "Fill-in-the-Blanks" Training (Optical Prescription Completion)

Imagine you are teaching a student to be a master architect. Instead of just asking them to build a house from scratch, you give them a half-built house with missing bricks and ask them to figure out what goes in the empty spots.

  • What they did: They created a massive dataset (called OptiDesignQA) filled with real, working lens designs. They hid some numbers (like the curve of the glass or the thickness) and forced the AI to guess the missing pieces based on the rest of the design.
  • The Result: This forced the AI to stop just "guessing words" and start understanding the hidden rules of how glass pieces fit together.

2. The "Strict Coach" (Physics-Driven Rewards)

In normal AI training, the computer gets a "gold star" if it writes a sentence that sounds good. But in lens design, sounding good isn't enough; it has to work.

  • The Problem: If the AI makes a mistake, a normal AI might just keep going. OPTIAGENT has a "Strict Coach" (a reward system) that checks the design at every step.
    • Level 1 (Format): Did you write the numbers in the right order? If not, zero points.
    • Level 2 (Structure): Did you accidentally make the glass pieces overlap or have negative thickness? If yes, zero points.
    • Level 3 (Physics): Does the light actually focus where it's supposed to? If not, zero points.
  • The Analogy: It's like a video game where you can't move to the next level until you perfectly solve the physics puzzle. The AI learns that "looking smart" doesn't matter; "working correctly" is the only way to win.

3. The "Human-in-the-Loop" (Zemax Integration)

Even after the AI gets really good, it's still an AI. It might get the design 95% right.

  • The Strategy: OPTIAGENT doesn't try to be perfect on its own. Instead, it acts as a super-fast sketch artist. It generates a "good enough" starting point in seconds.
  • The Finish: It then hands this sketch to a professional, high-powered software (called Zemax) that does the final, tiny, precise adjustments.
  • The Result: You get a professional-grade lens design in minutes instead of weeks.

Why is this a big deal?

Before this, if you wanted a custom lens, you needed a PhD in optics and a team of engineers. Now, with OPTIAGENT, a regular person can say, "I need a lens that is this big, sees this wide, and is this bright," and the AI will generate a working blueprint that a machine can actually build.

In summary:
The paper teaches a smart AI to stop just "talking" about lenses and start "thinking" like a physicist. By forcing it to learn the rules of light and glass through a strict reward system, they turned a text-generating robot into a capable optical engineer.

Get papers like this in your inbox

Personalized daily or weekly digests matching your interests. Gists or technical summaries, in your language.

Try Digest →