cs.LG papers | Gist.Science

Beyond Cross-Validation: Adaptive Parameter Selection for Kernel-Based Gradient Descents

This paper proposes a novel, implementable adaptive parameter selection strategy for kernel-based gradient descent that integrates bias-variance analysis with the splitting method and empirical effective dimension to achieve optimal generalization error bounds across diverse kernels, target functions, and error metrics.

Xiaotong Liu, Yunwen Lei, Xiangyu Chang + 1 more2026-03-05🤖 cs.LG

Heterogeneous Time Constants Improve Stability in Equilibrium Propagation

This paper introduces heterogeneous time steps (HTS) to Equilibrium Propagation, demonstrating that assigning neuron-specific time constants drawn from biologically motivated distributions improves training stability and robustness while maintaining competitive performance.

Yoshimasa Kubo, Suhani Pragnesh Modi, Smit Patel2026-03-05🤖 cs.AI

Surprisal-Rényi Free Energy

This paper introduces the Surprisal-Rényi Free Energy (SRFE), a novel log-moment-based functional that bridges forward and reverse Kullback-Leibler divergences by revealing a mean-variance tradeoff and providing a variational characterization that controls large deviations in code-length, thereby clarifying the geometric and statistical structure underlying these distinct learning objectives.

Shion Matsumoto, Raul Castillo, Benjamin Prada + 1 more2026-03-05🤖 cs.LG

A Short Note on a Variant of the Squint Algorithm

This paper introduces a simple variant of the Squint algorithm for the classic expert problem and proves, via a straightforward modification of the original proof, that it achieves a regret bound similar to that of a recent variant of the NormalHedge algorithm.

Haipeng Luo2026-03-05🤖 cs.LG

Scalable Contrastive Causal Discovery under Unknown Soft Interventions

This paper proposes a scalable, contrastive causal discovery model that leverages paired observational and single-regime soft interventional data to construct globally consistent causal structures, theoretically proving its ability to recover identifiable edges and outperform non-contrastive methods in both in-distribution and out-of-distribution scenarios.

Mingxuan Zhang, Khushi Desai, Sopho Kevlishvili + 1 more2026-03-05🤖 cs.LG

[Re] FairDICE: A Gap Between Theory And Practice

This replication study of FairDICE, a multi-objective offline reinforcement learning algorithm, reveals that while its theoretical claims hold, a critical code error initially reduced it to standard behavior cloning and underspecified hyperparameters hindered reproducibility, though corrected experiments demonstrate its potential to scale to complex environments despite a reliance on online tuning.

Peter Adema, Karim Galliamov, Aleksey Evstratovskiy + 1 more2026-03-05🤖 cs.LG

Half the Nonlinearity Is Wasted: Measuring and Reallocating the Transformer's MLP Budget

This paper demonstrates that a significant portion of transformer MLP nonlinearity is redundant and context-dependent, showing that a lightweight gating mechanism can dynamically replace these computations with linear surrogates to reduce computational waste or, when applied strategically with full retraining, actively improve model performance by eliminating harmful nonlinearities.

Peter Balogh2026-03-05🤖 cs.LG

Graph Hopfield Networks: Energy-Based Node Classification with Associative Memory

This paper introduces Graph Hopfield Networks, an energy-based framework that unifies associative memory retrieval with graph Laplacian smoothing to achieve state-of-the-art node classification performance and enhanced robustness across diverse graph benchmarks.

Abinav Rao, Alex Wa, Rishi Athavale2026-03-05🤖 cs.AI

Biased Generalization in Diffusion Models

This paper challenges the conventional practice of stopping diffusion model training at the minimum test loss by identifying a "biased generalization" phase where models continue to lower loss while overfitting to training data, a phenomenon driven by the sequential nature of feature learning that poses risks for privacy-critical applications.

Jerome Garnier-Brun, Luca Biggio, Davide Beltrame + 2 more2026-03-05🤖 cs.LG

When Shallow Wins: Silent Failures and the Depth-Accuracy Paradox in Latent Reasoning

This paper reveals that state-of-the-art mathematical reasoning models often achieve high benchmark accuracy through computationally unstable and unfaithful pathways, masking significant rates of silent failures and demonstrating that increased model scale does not necessarily improve reliability or correctness.

Subramanyam Sahoo, Aman Chadha, Vinija Jain + 1 more2026-03-05🤖 cs.AI

Minimax Optimal Strategy for Delayed Observations in Online Reinforcement Learning

This paper proposes a minimax optimal algorithm for tabular reinforcement learning with delayed state observations that combines augmentation and upper confidence bound methods to achieve a regret bound of $\tilde{\mathcal{O}}(H \sqrt{D_{\max} SAK})$ , which is proven to be optimal up to logarithmic factors.

Harin Lee, Kevin Jamieson2026-03-05🤖 cs.LG

Beyond Pixel Histories: World Models with Persistent 3D State

The paper introduces PERSIST, a novel world model paradigm that simulates the evolution of a latent 3D scene to overcome the spatial memory and consistency limitations of existing video generation methods, thereby enabling coherent, long-horizon interactive experiences with persistent 3D state and geometry-aware control.

Samuel Garcin, Thomas Walker, Steven McDonagh + 5 more2026-03-05🤖 cs.AI

Optimal trajectory-guided stochastic co-optimization for e-fuel system design and real-time operation

This paper introduces MasCOR, a machine-learning-assisted framework that overcomes the computational limitations of traditional mathematical programming to enable rapid, near-optimal co-optimization of e-fuel system design and real-time operation under renewable uncertainty, demonstrating site-specific strategies for cost-effective carbon-neutral methanol production across diverse European locations.

Jeongdong Kim, Minsu Kim, Jonggeol Na + 1 more2026-03-05🤖 cs.AI

When Small Variations Become Big Failures: Reliability Challenges in Compute-in-Memory Neural Accelerators

This paper addresses the critical reliability challenges in Compute-in-Memory neural accelerators caused by device non-idealities by demonstrating the disproportionate impact of small variations on safety-critical workloads and proposing cross-layer solutions, including a selective write-verify mechanism (SWIM) and noise-aware training, to ensure robust and efficient deployment.

Yifan Qin, Jiahao Zheng, Zheyu Yan + 3 more2026-03-05🤖 cs.LG

Quantifying Ranking Instability Across Evaluation Protocol Axes in Gene Regulatory Network Benchmarking

This paper introduces a diagnostic framework demonstrating that rankings of gene regulatory network inference methods exhibit significant instability across evaluation protocol axes, driven primarily by shifts in relative discrimination ability rather than base rate effects, thereby challenging the assumption of ranking invariance in current benchmarking practices.

Ihor Kendiukhov2026-03-05🤖 cs.LG

Geographically-Weighted Weakly Supervised Bayesian High-Resolution Transformer for 200m Resolution Pan-Arctic Sea Ice Concentration Mapping and Uncertainty Estimation using Sentinel-1, RCM, and AMSR2 Data

This study proposes a novel Geographically-Weighted Weakly Supervised Bayesian High-Resolution Transformer that fuses Sentinel-1, RCM, and AMSR2 data to generate 200m resolution pan-Arctic sea ice concentration maps with reliable uncertainty estimates, effectively overcoming challenges related to subtle feature extraction, inexact labels, and data heterogeneity.

Mabel Heffring, Lincoln Linlin Xu2026-03-05🤖 cs.LG

Solving adversarial examples requires solving exponential misalignment

This paper argues that adversarial examples arise from an exponential misalignment between the high-dimensional perceptual manifolds of neural networks and human concepts, suggesting that achieving robustness requires aligning these dimensions to match human perception.

Alessandro Salvatore, Stanislav Fort, Surya Ganguli2026-03-05🤖 cs.LG

Orbital Transformers for Predicting Wavefunctions in Time-Dependent Density Functional Theory

This paper introduces OrbEvo, an equivariant graph transformer model that efficiently predicts time-dependent electronic wavefunctions and related physical properties under external fields by learning to evolve orbital coefficients through autoregressive rollout, thereby overcoming the computational bottlenecks of conventional real-time time-dependent density functional theory.

Xuan Zhang, Haiyang Yu, Chengdong Wang + 3 more2026-03-05🔬 cond-mat.mtrl-sci

MMAI Gym for Science: Training Liquid Foundation Models for Drug Discovery

The paper introduces the MMAI Gym for Science, a comprehensive framework for training efficient, purpose-built Liquid Foundation Models that outperform larger general-purpose and specialist models on critical drug discovery tasks by mastering the specific "language of molecules."

Maksim Kuznetsov, Zulfat Miftahutdinov, Rim Shayakhmetov + 17 more2026-03-05🤖 cs.AI

Q-Measure-Learning for Continuous State RL: Efficient Implementation and Convergence

This paper proposes Q-Measure-Learning, an efficient online reinforcement learning algorithm for continuous state spaces that represents the action-value function as a signed empirical measure updated via coupled stochastic approximation, achieving almost sure convergence to a kernel-smoothed Bellman fixed point with linear memory and computational complexity.

Shengbo Wang2026-03-05🤖 cs.LG

← Previous Next →