← Latest papers
⚛️ quantum physics

Training single-electron and single-photon stochastic physical neural networks

This paper proposes and demonstrates the training of novel single-electron and single-photon stochastic physical neural networks, showing that using empirical outputs in the backward pass enables these noise-resilient architectures to achieve over 97% accuracy on MNIST digit classification despite high stochasticity and model uncertainty.

Original authors: Tong Dou, Shiro Kumara, Josh Burns, Ethan Sigler, Parth Girdhar, David Petty, Gerard Milburn, Jo Plested, Matt Woolley

Published 2026-04-14
📖 6 min read🧠 Deep dive

Original authors: Tong Dou, Shiro Kumara, Josh Burns, Ethan Sigler, Parth Girdhar, David Petty, Gerard Milburn, Jo Plested, Matt Woolley

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

The Big Idea: Building Brains Out of Dice and Light

Imagine you are trying to build a super-smart computer brain (a neural network) to recognize handwritten numbers, like the digits on a mail envelope.

Usually, we build these brains using silicon chips that act like perfect, deterministic calculators. If you give them the same input twice, they give the exact same answer twice. But these chips are hungry; they eat a lot of electricity and generate a lot of heat.

This paper asks a bold question: What if we built these brains using the actual, messy, random nature of the physical world? What if, instead of a perfect calculator, our "neurons" were like rolling dice or flipping a coin?

The authors propose a new type of computer called a Physical Neural Network (PNN). In these networks, the "neurons" aren't software code; they are real physical devices (like tiny electronic dots or beams of light) that behave randomly.

The Problem: How Do You Train a Random Brain?

Here is the tricky part: In a normal computer, if you make a mistake, the computer knows exactly why and how to fix it. It's like a student who gets a math problem wrong, looks at the steps, and says, "Ah, I added 2+2 wrong."

But in a Stochastic PNN, the "neuron" is a coin flip.

  • Input: "Is this a 7?"
  • Neuron: Flips a coin.
  • Output: "Heads (Yes)!" or "Tails (No)!"

If the coin lands on "No" and the answer was actually "Yes," the computer can't just look at the math and say, "I should have added differently." The coin is random! You can't calculate the gradient (the direction to fix the mistake) easily because the output is just a random 0 or 1.

It's like trying to teach a student who only answers questions by rolling dice. How do you tell them to study harder if they just got lucky?

The Solution: Three New Types of "Dice Rollers"

The authors designed three specific physical devices to act as these random neurons:

  1. The Single-Electron Transistor (The Tiny Electron Gate):
    Imagine a tiny room (a quantum dot) where electrons (tiny charged particles) want to get in. The "pre-activation" (the input signal) is like a gate that controls how likely an electron is to tunnel through.

    • The Randomness: Sometimes an electron sneaks in; sometimes it doesn't. It's a game of chance.
    • The Output: If an electron is inside, the neuron says "1" (Active). If the room is empty, it says "0" (Inactive).
  2. The Single-Photon Detector (The Light Catcher):
    Imagine a very sensitive camera that can detect just one photon (a particle of light).

    • The Randomness: You shine a dim light at it. Sometimes the camera clicks (detects a photon), sometimes it doesn't. It's based on the random nature of light.
    • The Output: Click = "1", No Click = "0".
  3. The True Single-Photon Neuron (The Quantum Switch):
    This is the most futuristic one. Imagine a machine that shoots exactly one photon at a time. This photon hits a special mirror (a beam splitter) that can either let it pass or reflect it.

    • The Randomness: The photon has a 50/50 (or adjustable) chance of going one way or the other.
    • The Output: If it goes to detector A, the neuron is "1". If it goes to detector B, it's "0".

The Training Strategy: How to Teach the Dice

Since the neurons are random, the authors had to invent new ways to "teach" the network. They tested three main strategies:

1. The "Perfect Knowledge" Method (True Probability)

Imagine you are the teacher, and you know the exact odds of the coin landing on heads (e.g., 70%). Even though the student (the computer) only sees the result of one flip, you, the teacher, know the underlying probability.

  • How it works: During training, the computer ignores the single random flip and instead uses the mathematical average (the 70% chance) to figure out how to adjust the weights.
  • Result: This works great, but it requires knowing the exact physics of the device perfectly, which is hard in the real world.

2. The "Empirical" Method (Learning from Samples)

Now, imagine you don't know the odds. You only see the results of the coin flips.

  • How it works: The computer looks at the actual results (e.g., "It flipped heads 3 times out of 10"). It uses these real-world samples to guess the direction to move. It's like a student who learns by trial and error, keeping a tally of what happened.
  • Result: This is very practical because you don't need to know the perfect physics. The paper shows that even with very few samples (just a few coin flips), the network can still learn to recognize digits with over 97% accuracy.

3. The "Straight-Through" Method (The Magic Trick)

This is a clever shortcut.

  • How it works: During the "forward pass" (making a guess), the computer lets the coin flip happen (it's random). But during the "backward pass" (learning from mistakes), the computer pretends the coin flip was smooth and predictable, as if it were a normal calculator.
  • Result: It's a bit of a lie, but it works surprisingly well. It allows the network to learn quickly without getting stuck on the randomness.

The Results: Why This Matters

The authors tested these ideas on a classic task: recognizing handwritten numbers (the MNIST dataset).

  • High Accuracy: Even though the neurons were "noisy" and random, the network achieved 97%+ accuracy.
  • Robustness: The network didn't break when the "coins" were biased or the "light" was dim. It handled the noise naturally.
  • Few Trials Needed: You don't need to flip the coin a million times to get a good answer. Even flipping it just a few times per layer was enough to train the brain effectively.

The Takeaway: Embracing the Chaos

The main message of this paper is that we don't need to fight against the randomness of the physical world to build smart computers. In fact, we can embrace it.

Think of it like this:

  • Old Way: Build a perfect, rigid, energy-hungry robot that tries to eliminate all mistakes.
  • New Way: Build a flexible, low-energy system that uses the natural "fuzziness" of electrons and photons as a feature, not a bug.

By treating the randomness as a built-in part of the brain (like a probabilistic switch), we can create computers that are potentially much faster and use a fraction of the energy of today's supercomputers. This opens the door to "green" AI that runs on the fundamental laws of physics rather than brute-force digital calculation.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →