Reward Under Attack: Analyzing the Robustness and Hackability of Process Reward Models

This paper reveals that state-of-the-art Process Reward Models (PRMs) are systematically exploitable by adversarial optimization, functioning primarily as fluency detectors rather than reasoning verifiers due to a critical dissociation between stylistic changes and ground-truth accuracy, prompting the release of a diagnostic framework and benchmark to address these vulnerabilities.

Rishabh Tiwari, Aditya Tomar, Udbhav Bamba, Monishwaran Maheswaran, Heng Yang, Michael W. Mahoney, Kurt Keutzer, Amir Gholami2026-03-10🤖 cs.LG

From ARIMA to Attention: Power Load Forecasting Using Temporal Deep Learning

This paper empirically demonstrates that a Transformer model utilizing self-attention mechanisms outperforms traditional ARIMA and recurrent neural network approaches (LSTM, BiLSTM) in short-term power load forecasting on PJM data, achieving a superior 3.8% MAPE and highlighting the effectiveness of attention-based architectures for capturing complex temporal patterns.

Suhasnadh Reddy Veluru, Sai Teja Erukude, Viswa Chaitanya Marella2026-03-10🤖 cs.LG

T-REX: Transformer-Based Category Sequence Generation for Grocery Basket Recommendation

The paper proposes T-REX, a novel transformer-based architecture that addresses the unique challenges of online grocery shopping by generating personalized category-level basket recommendations through dynamic sequence splitting, adaptive positional encoding, and causal masking to effectively capture both short-term dependencies and long-term user preferences.

Soroush Mokhtari, Muhammad Tayyab Asif, Sergiy Zubatiy2026-03-10🤖 cs.LG

A new Uncertainty Principle in Machine Learning

This paper proposes a new "Uncertainty Principle" in machine learning, asserting that the sharpness of a minimum in polynomial-based problems is inversely related to the smoothness of the optimization landscape, a phenomenon caused by the degeneracy of Heaviside and sigmoid expansions that traps gradient descent and necessitates a physics-based rather than purely computational approach to solving these scientific problems.

V. Dolotin, A. Morozov2026-03-10🤖 cs.LG

HEARTS: Benchmarking LLM Reasoning on Health Time Series

The paper introduces HEARTS, a comprehensive benchmark comprising 16 real-world health datasets and 110 tasks across four reasoning capabilities, which reveals that current large language models significantly underperform specialized models in health time series analysis due to struggles with multi-step temporal reasoning and reliance on simple heuristics.

Sirui Li, Shuhan Xiao, Mihir Joshi, Ahmed Metwally, Daniel McDuff, Wei Wang, Yuzhe Yang2026-03-10🤖 cs.LG

Roots Beneath the Cut: Uncovering the Risk of Concept Revival in Pruning-Based Unlearning for Diffusion Models

This paper reveals that pruning-based unlearning in diffusion models is inherently insecure because the locations of pruned weights act as side-channel signals that enable a novel, data-free, and training-free attack to fully revive erased concepts, prompting a call for safer pruning mechanisms that conceal these locations.

Ci Zhang, Zhaojun Ding, Chence Yang, Jun Liu, Xiaoming Zhai, Shaoyi Huang, Beiwen Li, Xiaolong Ma, Jin Lu, Geng Yuan2026-03-10🤖 cs.LG

Quantum Deep Learning: A Comprehensive Review

This comprehensive review defines Quantum Deep Learning (QDL) through a four-paradigm taxonomy, critically assesses its theoretical foundations and experimental implementations across various hardware systems, and outlines a verification-aware roadmap for transitioning from near-term demonstrations to scalable, fault-tolerant applications.

Yanjun Ji, Zhao-Yun Chen, Marco Roth, David A. Kreplin, Christian Schiffer, Martin King, Oliver Anton, M. Sahnawaz Alam, Markus Krutzik, Dennis Willsch, Ludwig Mathey, Frank K. Wilhelm, Guo-Ping Guo2026-03-10⚛️ quant-ph

Trust Aware Federated Learning for Secure Bone Healing Stage Interpretation in e-Health

This paper proposes a trust-aware federated learning framework that utilizes an Adaptive Trust Score Scaling and Filtering mechanism to secure bone healing stage interpretation in e-Health by mitigating the impact of unreliable or adversarial participants while maintaining model integrity and predictive performance.

Paul Shepherd, Tasos Dagiuklas, Bugra Alkan, Joaquim Bastos, Jonathan Rodriguez2026-03-10🤖 cs.LG

HURRI-GAN: A Novel Approach for Hurricane Bias-Correction Beyond Gauge Stations using Generative Adversarial Networks

The paper introduces HURRI-GAN, a novel TimeGAN-based framework that corrects systemic biases in high-resolution hurricane simulation models like ADCIRC, enabling accurate, near real-time storm surge forecasting and bias extrapolation beyond gauge station locations while significantly reducing computational runtime.

Noujoud Nadera, Hadi Majed, Stefanos Giaremis, Rola El Osta, Clint Dawson, Carola Kaiser, Hartmut Kaiser2026-03-10🤖 cs.LG