stat.ML papers | Gist.Science

Universality of General Spiked Tensor Models

This paper establishes the universality of high-dimensional spectral behavior and statistical limits for asymmetric rank-one spiked tensor models with non-Gaussian noise, demonstrating that the maximum-likelihood estimator's performance matches the Gaussian case under finite fourth-moment assumptions.

Yanjin Xiang, Zhihua ZhangThu, 12 Ma📊 stat

Emergence of Distortions in High-Dimensional Guided Diffusion Models

This paper formalizes the loss of diversity in classifier-free guidance as "generative distortion," characterizes its emergence via a high-dimensional phase transition using statistical physics, and proposes a novel guidance schedule with a negative-guidance window to mitigate variance shrinkage while preserving class separability.

Enrico Ventura, Beatrice Achilli, Luca Ambrogioni, Carlo LucibelloThu, 12 Ma📊 stat

Singular Bayesian Neural Networks

This paper proposes Singular Bayesian Neural Networks, which parameterize weights as low-rank products to induce a singular posterior that captures structured correlations, thereby achieving competitive predictive performance and improved uncertainty calibration with significantly fewer parameters and tighter generalization bounds compared to standard mean-field approaches.

Mame Diarra Toure, David A. StephensThu, 12 Ma📊 stat

Error Analysis of Bayesian Inverse Problems with Generative Priors

This paper presents a theoretical analysis establishing quantitative error bounds for Bayesian inverse problems using generative priors, demonstrating that the posterior error inherits the convergence rate of the prior in Wasserstein distance, and validates these findings through numerical experiments on benchmarks and an elliptic PDE inverse problem.

Bamdad Hosseini, Ziqi HuangThu, 12 Ma📊 stat

Transfer learning for functional linear regression via control variates

This paper proposes a control-variates-based transfer learning approach for scalar-on-function regression that utilizes dataset-specific summary statistics to preserve privacy, establishes a theoretical equivalence between offset and control-variates methods, and derives convergence rates that account for discretization errors and cross-dataset covariance similarities.

Yuping Yang, Zhiyang ZhouThu, 12 Ma📊 stat

Rethinking Few-Shot Image Fusion: Granular Ball Priors Enable General-Purpose Deep Fusion

This paper proposes a few-shot image fusion framework that leverages a Granular Ball Pixel Computation algorithm to generate adaptive, confidence-aware incomplete priors, enabling a lightweight neural network to learn effective fusion rules from minimal data without requiring real fused images for supervision.

Minjie Deng, Yan Wei, An Wu, Yuncan Ouyang, Hao Zhai, Qianyao PengThu, 12 Ma⚡ eess

Gradient Dynamics of Attention: How Cross-Entropy Sculpts Bayesian Manifolds

This paper provides a first-order analysis demonstrating that cross-entropy training in transformers induces a coupled specialization of attention routing and value updates—functioning as a two-timescale EM procedure—that sculpts low-dimensional Bayesian manifolds, thereby explaining how gradient-based optimization enables precise probabilistic reasoning.

Naman Agarwal, Siddhartha R. Dalal, Vishal MisraThu, 12 Ma📊 stat

Maximum Risk Minimization with Random Forests

This paper introduces computationally efficient and statistically consistent Random Forest variants that minimize the maximum risk across diverse environments to improve out-of-distribution generalization, offering novel guarantees for mean squared error, negative reward, and regret-based risks.

Francesco Freni, Anya Fries, Linus Kühne, Markus Reichstein, Jonas PetersThu, 12 Ma📊 stat

EarthquakeNPP: A Benchmark for Earthquake Forecasting with Neural Point Processes

The paper introduces EarthquakeNPP, a rigorous benchmark for earthquake forecasting that reveals current Neural Point Process models fail to outperform the classical ETAS model, highlighting the need for improved collaboration between seismology and machine learning communities.

Samuel Stockman, Daniel Lawson, Maximilian WernerThu, 12 Ma🔬 physics

Two-sample comparison through additive tree models for density ratios

This paper proposes additive tree models for two-sample density ratio estimation using a novel balancing loss that enables efficient training via supervised learning algorithms and generalized Bayesian inference for uncertainty quantification, with demonstrated effectiveness in high-dimensional settings and generative model assessment.

Naoki Awaya, Yuliang Xu, Li MaThu, 12 Ma📊 stat

Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand

This paper proposes a novel data-driven framework using offline reinforcement learning and survival analysis to estimate optimal pricing and inventory control policies in sequential environments with censored and dependent demand, overcoming challenges like missing profit information and non-stationarity by approximating the problem as a high-order Markov decision process.

Korel Gundem, Zhengling QiThu, 12 Ma📊 stat

Pairwise Comparisons without Stochastic Transitivity: Model, Theory and Applications

This paper proposes a general family of statistical models for pairwise comparisons that relaxes the restrictive stochastic transitivity assumption by utilizing low-dimensional skew-symmetric matrices, thereby achieving minimax-rate optimality and superior predictive performance in complex, real-world scenarios where traditional models like Bradley-Terry fail.

Sze Ming Lee, Yunxiao ChenThu, 12 Ma📊 stat

Conditional Local Importance by Quantile Expectations

The paper introduces CLIQUE, a novel model-agnostic method for calculating local variable importance that overcomes the limitations of existing techniques like LIME and SHAP by capturing locally dependent relationships and interaction behaviors while being directly applicable to multi-class classification problems.

Kelvyn K. Bladen, Adele Cutler, D. Richard Cutler, Kevin R. MoonThu, 12 Ma📊 stat

Losing dimensions: Geometric memorization in generative diffusion

This paper proposes a geometric memorization theory demonstrating that diffusion models transition from generalization to exact copying through a smooth, gradual collapse of latent dimensionality, where salient features and finer details progressively "freeze out" as data becomes scarce, mirroring physical systems condensing into low-energy configurations.

Beatrice Achilli, Enrico Ventura, Gianluigi Silvestri, Bao Pham, Gabriel Raya, Dmitry Krotov, Carlo Lucibello, Luca AmbrogioniThu, 12 Ma📊 stat

Learning Robust Treatment Rules for Censored Data

This paper proposes two robust criteria and a corresponding sampling-based difference-of-convex algorithm for learning optimal treatment rules that maximize truncated mean survival time and buffered survival probabilities in the presence of censored survival data, demonstrating superior performance through simulations and an AIDS clinical trial application.

Yifan Cui, Junyi Liu, Tao Shen, Zhengling Qi, Xi ChenThu, 12 Ma📊 stat

When should we trust the annotation? Selective prediction for molecular structure retrieval from mass spectra

This paper introduces a selective prediction framework for molecular structure retrieval from mass spectra that leverages retrieval-level uncertainty and distribution-free risk control to allow models to abstain from low-confidence predictions, thereby ensuring annotations meet specified error rate constraints in high-stakes applications.

Mira Jürgens, Gaetan De Waele, Morteza Rakhshaninejad, Willem WaegemanThu, 12 Ma📊 stat

Beyond Accuracy: Reliability and Uncertainty Estimation in Convolutional Neural Networks

This paper compares Monte Carlo Dropout and Conformal Prediction for uncertainty estimation in CNNs trained on Fashion-MNIST, revealing that while H-CNN VGG16 achieves higher accuracy, GoogLeNet offers better calibration and Conformal Prediction provides statistically guaranteed reliability for high-stakes applications.

Sanne Ruijs, Alina Kosiakova, Farrukh JavedThu, 12 Ma📊 stat

GGMPs: Generalized Gaussian Mixture Processes

This paper introduces Generalized Gaussian Mixture Processes (GGMPs), a scalable and tractable Gaussian process-based framework that enables multimodal conditional density estimation by combining local mixture fitting, cross-input component alignment, and per-component heteroscedastic GP training to overcome the unimodal limitations of standard GP regression.

Vardaan Tekriwal, Mark D. Risser, Hengrui Luo, Marcus M. NoackThu, 12 Ma🤖 cs.LG

Designing Service Systems from Textual Evidence

This paper introduces PP-LUCB, a cost-efficient algorithm that optimally combines biased LLM-generated proxy scores with selective human audits to identify the best service system configuration while providing statistically valid confidence guarantees and significantly reducing audit costs.

Ruicheng Ao, Hongyu Chen, Siyang Gao, Hanwei Li, David Simchi-LeviThu, 12 Ma🤖 cs.LG

Bayesian Optimization with Gaussian Processes to Accelerate Stationary Point Searches

This paper presents a unified Bayesian optimization framework using Gaussian processes with derivative observations and advanced extensions like Optimal Transport and random Fourier features to efficiently accelerate the search for minima and saddle points on potential energy surfaces, bridging theoretical formulation with practical implementation through accompanying Rust code.

Rohit Goswami (Institute IMX and Lab-COSMO, École polytechnique fédérale de Lausanne)Thu, 12 Ma📊 stat

← Previous Next →