Learning to Decode Quantum LDPC Codes Via Belief Propagation
This paper proposes a reinforcement learning-based decoder for quantum LDPC codes that formulates decoding as a Markov decision process with local syndrome-driven states and second-order neighborhood updates to overcome convergence issues caused by quantum degeneracy and short cycles, achieving superior performance and faster convergence than traditional schedules while maintaining competitive complexity.