stat.ML papers | Gist.Science

Invariance-Based Dynamic Regret Minimization

The paper proposes ISD-linUCB, an algorithm for stochastic non-stationary linear bandits that leverages the decomposition of reward models into stationary and non-stationary components to learn invariances from historical data, thereby reducing problem dimensionality and significantly improving dynamic regret in fast-changing environments.

Margherita Lazzaretto, Jonas Peters, Niklas Pfister2026-03-05🤖 cs.LG

Hierarchical Inference and Closure Learning via Adaptive Surrogates for ODEs and PDEs

This paper proposes a hierarchical Bayesian framework that integrates adaptive surrogate models, such as Fourier Neural Operators and parametric PINNs, with ensemble MALA sampling to simultaneously infer individual system parameters and learn shared unknown dynamics via ML-based closure models for ODEs and PDEs.

Pengyu Zhang, Arnaud Vadeboncoeur, Alex Glyn-Davies + 1 more2026-03-05🤖 cs.LG

Fixed-Budget Constrained Best Arm Identification in Grouped Bandits

This paper addresses the fixed-budget best-arm identification problem in grouped bandits with feasibility constraints by establishing a theoretical lower bound on error probability and proposing the Feasibility Constrained Successive Rejects (FCSR) algorithm, which achieves optimal performance guarantees while empirically outperforming natural baselines.

Raunak Mukherjee, Sharayu Moharir2026-03-05🤖 cs.LG

Exploiting Subgradient Sparsity in Max-Plus Neural Networks

This paper proposes a sparse subgradient algorithm that explicitly leverages the inherent sparsity of Max-Plus neural networks' subgradients to enable efficient and theoretically guaranteed training, overcoming the computational inefficiencies of standard backpropagation.

Ikhlas Enaieh, Olivier Fercoq2026-03-05🤖 cs.LG

Stable and Steerable Sparse Autoencoders with Weight Regularization

This paper demonstrates that applying L2 weight regularization to sparse autoencoders significantly enhances the stability and cross-seed consistency of learned features, leading to improved steering success rates and a stronger alignment between automated interpretability scores and functional controllability.

Piotr Jedryszek, Oliver M. Crook2026-03-05🤖 cs.LG

Beyond Mixtures and Products for Ensemble Aggregation: A Likelihood Perspective on Generalized Means

This paper establishes a principled theoretical framework for density aggregation by demonstrating that normalized generalized means with order $r \in [0,1]$ are the only rules guaranteeing systematic improvements in log-likelihood over individual distributions, thereby providing a unified justification for the widespread use of linear and geometric pooling in Deep Ensembles.

Raphaël Razafindralambo, Rémy Sun, Frédéric Precioso + 2 more2026-03-05🤖 cs.LG

Semi-Supervised Generative Learning via Latent Space Distribution Matching

This paper introduces Latent Space Distribution Matching (LSDM), a novel semi-supervised framework that leverages both paired and unpaired data to learn a latent space and perform joint distribution matching, thereby enhancing generation quality and providing theoretical insights into Latent Diffusion Models.

Kwong Yu Chong, Long Feng2026-03-05🤖 cs.LG

PTOPOFL: Privacy-Preserving Personalised Federated Learning via Persistent Homology

PTOPOFL is a privacy-preserving personalized federated learning framework that replaces gradient sharing with compact persistent homology descriptors to simultaneously mitigate data-reconstruction risks and improve aggregation performance on non-IID data through topology-guided clustering and weighted aggregation.

Kelly L Vomo-Donfack, Adryel Hoszu, Grégory Ginot + 1 more2026-03-05🤖 cs.LG

From Reachability to Learnability: Geometric Design Principles for Quantum Neural Networks

This paper reframes Quantum Neural Network design from state reachability to learnability by introducing geometric design principles and the almost Complete Local Selectivity (aCLS) criterion, demonstrating that architectures requiring joint dependence on data and trainable weights enable adaptive feature learning and outperform traditional schemes with greater efficiency.

Vishal S. Ngairangbam, Michael Spannowsky2026-03-03⚛️ quant-ph

Denoising Diffusion Probabilistic Models

This paper presents high-quality image synthesis using denoising diffusion probabilistic models, achieving state-of-the-art results on CIFAR10 and LSUN datasets through a novel training objective connecting diffusion models to denoising score matching with Langevin dynamics.

Jonathan Ho, Ajay Jain, Pieter Abbeel2020-06-19🤖 cs.LG

Scaling Laws for Neural Language Models

This paper establishes that language model performance follows predictable power-law scaling relationships with model size, dataset size, and compute, revealing that optimal training efficiency is achieved by prioritizing very large models trained on modest data amounts rather than training smaller models to convergence.

Jared Kaplan, Sam McCandlish, Tom Henighan + 7 more2020-01-23🤖 cs.LG

Generative Adversarial Networks

This paper proposes a new generative modeling framework based on a minimax two-player game between a generative model that captures data distribution and a discriminative model that distinguishes real from generated samples, which can be trained efficiently using backpropagation without Markov chains.

Ian J. Goodfellow, Jean Pouget-Abadie, Mehdi Mirza + 5 more2014-06-10📊 stat.ML

Auto-Encoding Variational Bayes

This paper introduces the Auto-Encoding Variational Bayes (AEVB) framework, which enables efficient stochastic variational inference and learning in directed probabilistic models with continuous latent variables by utilizing a reparameterized lower bound estimator and an approximate inference model to handle intractable posteriors and scale to large datasets.

Diederik P Kingma, Max Welling2013-12-20📊 stat.ML

← Previous