stat.ML 篇论文 | Gist.Science

On-Average Stability of Multipass Preconditioned SGD and Effective Dimension

本文首次建立了多轮次预条件随机梯度下降（PSGD）的平均算法稳定性理论，揭示了人口风险曲率、噪声几何与预条件策略之间的权衡关系，并证明了不当的预条件选择会导致基于有效维度的泛化与优化性能次优。

Simon Vary, Tyler Farghly, Ilja Kuzborskij, Patrick RebeschiniFri, 13 Ma📊 stat

BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs

本文提出了 BTZSC 基准，通过涵盖 22 个数据集对跨编码器、嵌入模型、重排序器及大语言模型进行了系统的零样本文本分类评估，发现现代重排序器性能最佳，而传统 NLI 跨编码器则表现停滞。

Ilias AarabFri, 13 Ma💬 cs.CL

Chemical Reaction Networks Learn Better than Spiking Neural Networks

该论文通过数学证明和数值实验表明，无隐藏层的化学反应网络在分类任务（如手写数字识别）上比需要隐藏层的脉冲神经网络具有更高的学习效率和准确性，并提供了相应的理论界限分析。

Sophie Jaffard, Ivo F. SbalzariniFri, 13 Ma📊 stat

Wasserstein Gradient Flows for Batch Bayesian Optimal Experimental Design

本文提出了一种基于 Wasserstein 梯度流的新型批量贝叶斯最优实验设计方法，通过将优化问题提升至概率测度空间并引入熵正则化，利用粒子算法有效解决了高维非凸批量设计中的优化难题。

Louis SharrockFri, 13 Ma📊 stat

A Quantitative Characterization of Forgetting in Post-Training

该论文基于双模态混合抽象，从理论上量化了生成模型持续后训练中的遗忘现象，揭示了前向与反向 KL 散度在质量遗忘和旧分量漂移上的不同机制，并阐明了重放策略及现有近于策略方法如何受散度方向、几何重叠度及采样机制的影响。

Krishnakumar Balasubramanian, Shiva Prasad KasiviswanathanFri, 13 Ma📊 stat

Riemannian Laplace Approximation with the Fisher Metric

本文指出基于 Fisher 度量的黎曼拉普拉斯近似在无限数据极限下仍存在偏差和过窄问题，并提出了两种修正变体，使其在保持计算高效的同时实现无限数据下的精确性，从而在理论和实验上均优于现有方法。

Hanlin Yu, Marcelo Hartmann, Bernardo Williams + 2 more2026-03-12🤖 cs.LG

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

本文提出了一种基于乐观主义的在线 KL 正则化上下文多臂老虎机算法，并通过利用 KL 正则化带来的良性优化景观，证明了该算法在奖励函数类复杂度较低时能达到对数级累积遗憾，且该结论可进一步推广至强化学习场景。

Heyang Zhao, Chenlu Ye, Wei Xiong + 2 more2026-03-12📊 stat

Geopolitics, Geoeconomics, and Sovereign Risk: Different Shocks, Different Channels

该研究基于 2018 至 2025 年 42 个经济体的数据，揭示了地缘政治冲击通过直接渠道重定价主权违约风险，而地缘经济冲击则通过货币政策预期和全球金融周期传导，两者形成“剪刀差”模式，并据此提出流动性供给可缓解金融周期传导的利差扩大，但无法消除地缘政治风险溢价中的持久成分。

Alvaro Ortiz, Tomasa Rodrigo, Pablo Saborido2026-03-12📊 stat

A Bandit-Based Approach to Educational Recommender Systems: Contextual Thompson Sampling for Learner Skill Gain Optimization

该论文提出了一种基于上下文汤普森采样的个性化练习推荐方法，利用学习者数据动态选择最能提升技能水平的题目，从而在大规模在线教育环境中实现高效的学习增益优化。

Lukas De Kerpel, Arthur Thuy, Dries F. Benoit2026-03-12📊 stat

SSRCA: a novel machine learning pipeline to perform sensitivity analysis for agent-based models

本文提出了一种名为 SSRCA 的新型机器学习流程，通过模拟、汇总、降维、聚类和分析五个步骤，有效解决了代理基模型（ABM）敏感性分析的计算难题，能够识别敏感参数、揭示输出模式并确定生成这些模式的参数区域，且相比传统的 Sobol 法具有更强的鲁棒性。

Edward H. Rohr, John T. Nardini2026-03-11🧬 q-bio

Accounting for shared covariates in semi-parametric Bayesian additive regression trees

本文提出了一种半参数贝叶斯加法回归树（BART）的新方法，通过改进树生成机制来解决线性预测器与 BART 组件共享协变量时的非识别性与偏差问题，从而允许对主要关注的协变量进行复杂的交互建模，并在教育评估等实际应用中展现了优越性能。

Estevão B. Prado, Andrew C. Parnell, Keefe Murphy + 3 more2026-03-10🤖 cs.LG

Convergence and complexity of block majorization-minimization for constrained block-Riemannian optimization

该论文提出并分析了一类用于约束块黎曼优化的块主化最小化（BMM）算法，证明了其在非凸光滑目标函数下渐近收敛至平稳点集且达到 $\epsilon$ -平稳点的迭代复杂度为 $\widetilde{O}(\epsilon^{-2})$ ，并验证了其在多种黎曼几何约束问题中优于标准欧氏算法的性能。

Yuchen Li, Laura Balzano, Deanna Needell + 1 more2026-03-10📊 stat

Zeroth-Order primal-dual Alternating Projection Gradient Algorithms for Nonconvex Minimax Problems with Coupled linear Constraints

本文针对具有耦合线性约束的非凸极小极大问题，提出了两种单循环零阶算法（ZO-PDAPG 和 ZO-RMPDPG），并在确定性和随机设定下分别证明了其达到 $\varepsilon$ -平稳点的迭代复杂度，填补了该领域零阶算法理论分析的空白，其中 ZO-RMPDPG 在无约束随机设定下还刷新了现有零阶算法的最优复杂度记录。

Huiling Zhang, Zi Xu, Yuhong Dai2026-03-06🔢 math

stat.ML

On-Average Stability of Multipass Preconditioned SGD and Effective Dimension

BTZSC: A Benchmark for Zero-Shot Text Classification Across Cross-Encoders, Embedding Models, Rerankers and LLMs

Chemical Reaction Networks Learn Better than Spiking Neural Networks

Wasserstein Gradient Flows for Batch Bayesian Optimal Experimental Design

A Quantitative Characterization of Forgetting in Post-Training

Riemannian Laplace Approximation with the Fisher Metric

Logarithmic Regret for Online KL-Regularized Reinforcement Learning

Geopolitics, Geoeconomics, and Sovereign Risk: Different Shocks, Different Channels

A Bandit-Based Approach to Educational Recommender Systems: Contextual Thompson Sampling for Learner Skill Gain Optimization

SSRCA: a novel machine learning pipeline to perform sensitivity analysis for agent-based models

Accounting for shared covariates in semi-parametric Bayesian additive regression trees

Convergence and complexity of block majorization-minimization for constrained block-Riemannian optimization

Zeroth-Order primal-dual Alternating Projection Gradient Algorithms for Nonconvex Minimax Problems with Coupled linear Constraints

Towards a Fairer Non-negative Matrix Factorization

An Experimental Study on Fairness-aware Machine Learning for Credit Scoring Problems

Curse of Dimensionality in Neural Network Optimization

Generalization Bounds for Markov Algorithms through Entropy Flow Computations

Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy

Variational Formulation of Particle Flow

Ice Cream Doesn't Cause Drowning: Benchmarking LLMs Against Statistical Pitfalls in Causal Inference