← Latest papers
⚛️ quantum physics

Planted-solution SAT and Ising benchmarks from integer factorization

This paper introduces a scalable and verifiable family of planted-solution benchmarks for SAT solvers and Ising optimization, derived from integer factorization constraints, which exhibit exponential runtime growth relative to the bit-length of the factors.

Original authors: Itay Hen

Published 2026-04-14
📖 4 min read🧠 Deep dive

Original authors: Itay Hen

Original paper licensed under CC BY 4.0 (http://creativecommons.org/licenses/by/4.0/). This is an AI-generated explanation of the paper below. It is not written or endorsed by the authors. For technical accuracy, refer to the original paper. Read full disclaimer

Imagine you are trying to build a massive, intricate puzzle. Usually, when researchers test how good a computer is at solving puzzles, they either give it a pile of random junk (which is hard to verify) or a puzzle they made up that looks nice but doesn't get harder in a predictable way.

This paper introduces a new, super-organized puzzle based on something we all know: multiplication.

Here is the breakdown of what the authors did, using simple analogies:

1. The Core Idea: The "Reverse Multiplication" Puzzle

Think of multiplication like a factory assembly line. You take two numbers (let's call them Prime A and Prime B), run them through a machine, and out comes a big number (Product N).

  • The Normal Way: You know A and B, you press "Go," and the machine spits out N. Easy.
  • The Puzzle (Factorization): You are given only N. You have to figure out what A and B were. This is the "hard" problem that protects credit card numbers on the internet.

The authors built a puzzle where the computer has to act like a detective. It has to find the two secret numbers (A and B) that, when multiplied, create N. But here's the trick: The authors know the answer. They planted the solution (A and B) inside the puzzle. This means they can check if the computer is right or wrong instantly.

2. How They Built the Puzzle: The "Carry-Over" Chain Reaction

To turn multiplication into a puzzle a computer can solve, they broke it down into tiny logical steps (like "Is this bit 0 or 1?").

Imagine you are doing long multiplication by hand on a piece of paper. When you multiply two columns, sometimes the result is too big for that column, so you have to "carry over" a number to the next column.

  • The Magic: In this puzzle, that "carry-over" isn't just a small note; it's a domino effect.
  • A tiny change in the first column can send a ripple of carries all the way across the page, affecting columns far away.
  • The authors realized that these "ripples" create a massive, complex web of connections. It's like a game of telephone where a whisper at the start of the line gets amplified and distorted as it travels to the end.

3. Why It's a Great Test (The "Stress Test")

The authors wanted to see how fast modern computers (called SAT solvers) could solve these puzzles.

  • The Growth: They found that as they made the numbers slightly bigger (adding just one or two digits), the puzzle didn't just get a little harder; it got exponentially harder.
  • The Analogy: Imagine climbing a ladder. In a normal puzzle, every rung is the same height. In this puzzle, every time you add a rung, the ladder doubles in height.
  • The Result: When they tested it, the computers took roughly twice as long to solve the puzzle for every single extra digit they added. This is exactly the kind of difficulty researchers need to test if new quantum computers or super-advanced AI can actually break encryption.

4. Two Ways to Look at the Puzzle

The paper is special because it translates the same puzzle into two different languages:

  1. SAT (Logic Language): Like a giant "True/False" checklist.
  2. Ising (Physics Language): Like a magnetic puzzle where you have to align tiny magnets (spins) to find the lowest energy state.

This is like giving a mechanic the same car engine problem, but describing it once in English and once in Spanish. It allows researchers to test different types of computers (logic-based vs. physics-based) on the exact same problem to see which one is better.

5. The "Blueprint"

The authors didn't just solve one puzzle; they built a machine that generates infinite puzzles.

  • You tell the machine: "I want a puzzle with 20-digit numbers."
  • It instantly builds a unique, verifiable puzzle with a known answer.
  • It's scalable: You can make it easy (small numbers) or impossible (huge numbers) just by turning a dial.

Summary

Think of this paper as the invention of a perfectly calibrated stress test for computers.

Before this, testing computers on factorization was like trying to guess how strong a bridge is by throwing random rocks at it. Now, the authors have built a crane that drops weights of exact, known sizes onto the bridge. They can see exactly when the bridge bends, how much it bends, and if it breaks, all while knowing exactly what the "correct" answer should be.

This helps scientists understand the limits of current technology and prepares us for the future, where we might need to know if our digital locks are truly safe against super-powerful new computers.

Drowning in papers in your field?

Get daily digests of the most novel papers matching your research keywords — with technical summaries, in your language.

Try Digest →