Sampling via Stochastic Interpolants by Langevin-based Velocity and Initialization Estimation in Flow ODEs

This paper proposes a novel sampling method for unnormalized Boltzmann densities that leverages a sequence of Langevin samplers to efficiently generate intermediate samples and robustly estimate the velocity field of a probability flow ODE derived from linear stochastic interpolants, offering theoretical convergence guarantees and demonstrating effectiveness in high-dimensional multimodal distributions and Bayesian inference tasks.

Chenguang Duan, Yuling Jiao, Gabriele Steidl, Christian Wald, Jerry Zhijian Yang, Ruizhe ZhangThu, 12 Ma📊 stat

The Bayesian Geometry of Transformer Attention

This paper introduces "Bayesian wind tunnels" to rigorously demonstrate that small transformers perform exact Bayesian inference through a specific geometric mechanism involving residual streams, feed-forward updates, and attention-based routing, thereby establishing a clear architectural advantage over flat models and providing a mechanistic foundation for understanding reasoning in large language models.

Naman Agarwal, Siddhartha R. Dalal, Vishal MisraThu, 12 Ma📊 stat

Absolute indices for determining compactness, separability and number of clusters

This paper introduces novel absolute cluster indices based on defined compactness functions and neighboring point sets to objectively determine cluster compactness, separability, and the true number of clusters, demonstrating their effectiveness across synthetic and real-world datasets compared to existing relative validity indices.

Adil M. Bagirov, Ramiz M. Aliguliyev, Nargiz Sultanova, Sona TaheriThu, 12 Ma📊 stat

Empirical Orlicz norms

This paper establishes a law of large numbers for empirical Orlicz norms under minimal assumptions and investigates their central limit behavior, revealing that while standard convergence rates hold under specific conditions, canonical cases like the sub-Gaussian norm of normal variables exhibit nonstandard n1/4n^{1/4} rates with stable limits, and the general class of distributions with bounded Orlicz norms admits no uniform rate of convergence.

Fabian MiesThu, 12 Ma📊 stat

Robust evaluation of treatment effects in longitudinal studies with truncation by death or other intercurrent events

This paper proposes the Pairwise Last Observation Time (PLOT) estimand, a novel, assumption-free causal inference method that robustly evaluates treatment effects in longitudinal studies by comparing matched individuals at their last common observation time before intercurrent events, thereby avoiding the unverifiable assumptions and sensitivity issues inherent in existing frameworks.

Georgi Baklicharov, Kelly Van Lancker, Stijn VansteelandtThu, 12 Ma📊 stat