Photometric Redshift PDFs via Neural Network Classification for DESI Legacy Imaging Surveys and Pan-STARRS

This paper introduces a neural network classification method that generates well-calibrated, multi-modal photometric redshift probability density functions for the DESI Legacy Imaging Surveys and Pan-STARRS, achieving superior accuracy and uncertainty quantification compared to traditional regression approaches by leveraging extensive spectroscopic training data and multi-wavelength photometry.

Original authors: Da-Chuan Tian, Zhong-Lue Wen, Jun-Qing Xia

Published 2026-05-07
📖 5 min read🧠 Deep dive

Original authors: Da-Chuan Tian, Zhong-Lue Wen, Jun-Qing Xia

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

The Big Picture: Guessing How Far Away Stars Are

Imagine you are looking at a crowd of people from a great distance. You can see their clothes and how bright they are, but you can't see their faces clearly. You want to know how far away each person is.

In astronomy, this is the problem of photometric redshift. Astronomers take pictures of billions of galaxies using different colored filters (like taking photos through red, blue, and green glasses). They want to know how far away each galaxy is based only on these colors and brightness levels.

The problem is that a galaxy can look "red" because it is very far away (its light has stretched), or because it is actually close but just happens to be a red, dusty galaxy. This is called a "degeneracy"—two different things looking the same.

The New Tool: A "Smart Sorter" Instead of a "Calculator"

Traditionally, computers tried to guess the exact distance of a galaxy, like a calculator giving you a single number (e.g., "500 million light-years"). But if the computer is wrong, it doesn't tell you how wrong it might be.

The authors of this paper built a new method called Neural Network Classification (NNC). Instead of acting like a calculator, their computer acts like a smart sorter.

  1. The Bins: Imagine a long shelf with 400 small boxes lined up, representing different distances (redshifts).
  2. The Job: Instead of picking one box, the computer looks at a galaxy and says, "I think there is a 60% chance it belongs in Box 100, a 30% chance in Box 101, and a 10% chance in Box 99."
  3. The Result: This gives a Probability Density Function (PDF). It's like a weather forecast that says, "There's a 60% chance of rain, 30% chance of clouds, 10% chance of sun," rather than just saying "It will rain." This tells astronomers not just the best guess, but how confident they should be.

The Secret Sauce: A Better Training Class

To teach this computer, you need a "training class" of galaxies where we already know the exact distance (measured by powerful spectroscopes).

  • The Old Class: Before this paper, the training class was mostly made of galaxies from the SDSS survey. It was like a class full of elementary school students. It was great for teaching about nearby things, but it had very few "high schoolers" (distant galaxies).
  • The New Class: The authors used data from DESI DR1, a massive new survey. This added millions of new "high schoolers" to the training class.
  • The Result: Because the computer was trained on a much wider variety of galaxies (including very distant ones), it became much better at guessing distances for the whole universe, especially for things far away.

The Two Surveys: Deep vs. Wide

The team tested their method on two different "cameras":

  1. LSDR10 (The Deep Camera): This camera takes very sharp, deep pictures of a specific area. It sees faint, distant objects clearly.
    • Result: The computer was incredibly accurate here. It was like using a high-end microscope.
  2. Pan-STARRS (The Wide Camera): This camera sees a much larger area of the sky, but the pictures are a bit shallower (less detailed).
    • The Fix: To help the computer with the Wide Camera, the authors added infrared data (heat signatures) from the unWISE survey.
    • The Analogy: Imagine trying to identify a fruit by color alone. A red apple and a red tomato look the same. But if you can also feel the temperature (infrared), you can tell them apart. Adding this "heat" data helped the computer distinguish between different types of galaxies much better, reducing errors by about 22%.

Why This Matters

The paper shows that this new "Smart Sorter" method is better than older methods (like Random Forests or standard neural networks) for two main reasons:

  • It handles confusion: When a galaxy looks like two different things at once (a common problem), the computer doesn't just guess one wrong answer. It shows a "double peak" in its probability, telling the astronomer, "It could be here OR there, I'm not sure."
  • It knows its limits: The computer is very good at telling you when it is confident and when it is guessing.

The Final Product: A Unified Map

The authors didn't just write a paper; they built a massive catalog. They combined the data from both cameras into one giant map of over 550 million galaxies.

They used a "hierarchical strategy" (a priority list):

  • If a galaxy is in the "Deep Camera" area, they use the most detailed model.
  • If it's only in the "Wide Camera" area, they use the model with the infrared help.
  • If it's in both, they pick the best one.

Summary

The authors created a new AI tool that sorts galaxies into distance "bins" instead of guessing a single number. By training it on a massive new dataset of known galaxies (DESI) and adding infrared "heat" data, they made the most accurate distance map of the universe to date for these specific surveys. This map is now available for other scientists to use in studying how the universe is expanding and evolving.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →