stat.AP 篇论文 | Gist.Science

Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density Estimators

该论文提出了一种基于核密度估计（KDE）的实用方法，通过建模合成数据与训练记录间的最近邻距离分布，在不依赖计算密集型影子模型的情况下，有效量化了表格合成数据中的成员披露风险，并实现了比现有基线更优的风险评估效果。

Rajdeep Pathak, Sayantee JanaThu, 12 Ma📊 stat

A Model-Based Restricted Shapley Value to Measure the Players' Contribution to Shot Actions in Football

该论文提出了一种结合合作博弈论与限制联盟结构的“球员受限沙普利值”（PRS）框架，通过引入包含传球网络的“预期射门行动”（xGA）指标，在意大利足球甲级联赛数据中量化了球员在进攻射门动作中的协同贡献。

Mattia Cefis, Rodolfo Metulini, Maurizio CarpitaThu, 12 Ma📊 stat

Don't Disregard the Data for Lack of a Likelihood: Bayesian Synthetic Likelihood for Enhanced Multilevel Network Meta-Regression

本文提出了一种基于贝叶斯合成似然（BSL）的改进型多层次网络 Meta 回归方法，通过利用亚组汇总数据并解决哈密顿蒙特卡洛（HMC）在随机梯度估计和非可微似然函数上的应用挑战，显著提升了在个体患者数据缺失情况下的治疗效果比较精度。

Harlan Campbell, Charles C. Margossian, Jeroen P. Jansen, Paul GustafsonThu, 12 Ma📊 stat

A mixed-frequency approach for exchange rates predictions

本文提出了一种基于混合频率模型的预测方法，旨在克服时间聚合导致的信息缺失问题，并通过 CAD/USD 汇率预测实证表明该方法在解决“梅斯 - 罗戈夫难题”方面优于现有方法。

Raffaele Mattera, Michelangelo Misuraca, Germana Scepi, Maria SpanoMon, 09 Ma🤖 cs.LG

An Integrated Time-Varying Ornstein-Uhlenbeck Process for Jointly Modeling Individual and Population-Level Movement of Golden Eagles

该研究提出了一种整合个体追踪与种群分布数据的全年时变 Ornstein-Uhlenbeck 过程模型，通过联合分析金雕的迁徙轨迹与 eBird 相对丰度数据，实现了对种群时空动态的高效推断、风场风险预测及基于后期观测的早期来源回溯。

Michael L. Shull, Ephraim M. Hanks, James C. Russell, Robert K. Murphy, Frances E. BudermanMon, 09 Ma📊 stat

Omnibus goodness-of-fit tests for univariate continuous distributions based on trigonometric moments

本文提出了一种基于概率积分变换数据三角矩的新颖拟合优度检验，通过充分利用三角统计量的协方差结构，使检验统计量在存在 nuisance 参数时仍收敛于 $\chi_2^2$ 分布，并提供了涵盖 11 种常用连续分布族的统一实现方案，经模拟验证具有准确的显著性水平和强大的检验功效。

Alain Desgagné, Frédéric OuimetMon, 09 Ma🔢 math

Learning Centre Partitions from Summaries

该论文提出了一种基于多中心汇总统计量的序贯聚类算法（CoC），通过多轮 Cochran 型检验与自助法重采样，在检验参数同质性的同时实现中心分组的准确恢复，并证明了其在大样本下以概率趋近于 1 恢复真实分组的理论性质。

Zinsou Max Debaly, Jean-Francois Ethier, Michael H. Neumann, Félix Camirand-LemyreMon, 09 Ma🔢 math

Data-Driven Bed Capacity Planning Using $M_t/G_t/\infty$ Queueing Models with an Application to Neonatal Intensive Care Units

该论文针对重症监护室长期床位规划中需求不确定性的挑战，提出了一种基于非平稳队列模型（ $M_t/G_t/\infty$ ）和数据驱动的方法，通过结合时变到达率与实证拟合的住院时长分布，揭示了传统静态启发式规则在应对波动需求时的不足，并为新生儿重症监护室（NICU）的床位容量规划提供了更精准的决策支持。

Maryam Akbari-Moghaddam, Douglas G. Down, Na Li, Catherine Eastwood, Ayman Abou Mehrem, Alexandra HowlettMon, 09 Ma🔢 math

Admittance Matrix Concentration Inequalities for Understanding Uncertain Power Networks

该论文利用随机矩阵的集中不等式，为不确定参数下的电力网络导纳矩阵谱及经典线性潮流模型建立了保守概率界限，揭示了误差界与节点关键性的关联，并通过 IEEE 测试系统验证了其在捕捉谱扰动缩放行为方面的有效性。

Samuel Talkington, Cameron Khanpour, Rahul K. Gupta, Sergio A. Dorado-Rojas, Daniel Turizo, Hyeongon Park, Dmitrii M. Ostrovskii, Daniel K. MolzahnMon, 09 Ma💻 cs

An intuitive rearranging of the Yates covariance decomposition for probabilistic verification of forecasts with the Brier score

该论文提出了一种对 Brier 分数中 Yates 协方差分解的直观代数重排，将其转化为方差失配、相关度不足和大尺度校准三个非负项，从而清晰揭示了完美概率预报需同时满足方差匹配、完全正相关及均值匹配的最优条件。

Bruno Hebling Vieira (Methods of Plasticity Research, Department of Psychology, University of Zurich, Zurich, Switzerland)Mon, 09 Ma🤖 cs.LG

Two-stage Adaptive Design Cluster Randomised Trials

本文提出了一种适用于整群随机试验的两阶段自适应设计方法，通过结合组合检验、多阶段样本量重估及帕累托最优平衡策略，有效解决了因群内相关性参数不确定导致的试验成本高昂问题，并展示了其在阶梯楔形设计及 E-MOTIVE 试验重分析中的应用。

Samuel I. Watson, James MartinMon, 09 Ma📊 stat

Behavior-dLDS: A decomposed linear dynamical systems model for neural activity partially constrained by behavior

本文提出了行为分解线性动态系统（b-dLDS）模型，旨在从大规模神经活动中解耦与行为直接相关的动态子系统和并行内部计算，并在模拟数据及斑马鱼大规模神经记录中验证了其在识别行为相关动态连接网络方面的优越性。

Eva Yezerets, En Yang, Misha B. Ahrens, Adam S. CharlesMon, 09 Ma🤖 cs.LG

Test-then-Punish: A Statistical Approach to Repeated Games

该论文提出了一种将统计假设检验嵌入博弈策略的“先测试后惩罚”框架，通过允许忽略极小概率历史并采用序贯或分批测试机制，在 imperfect monitoring（不完美监控）条件下成功扩展了重复博弈的民间定理，证明了足够耐心的玩家可维持任意可行且个体理性的收益。

Aymeric Capitaine, Antoine Scheid, Etienne Boursier, Alain Durmus, Michael I. JordanMon, 09 Ma💻 cs

Preoperative Decline and Postoperative Recovery of Wearable-Derived Physical Activity Over a Four-Year Perioperative Period in Total Knee and Hip Arthroplasty: Evidence from the All of Us Research Program

这项基于"All of Us"研究计划的数据分析表明，全膝关节和髋关节置换术患者术前活动量呈渐进性下降，术后呈现“快速改善—增速放缓—稳定”的三阶段恢复模式，且术前功能储备越高越有助于恢复至日常活动水平，凸显了长期可穿戴设备监测在优化围手术期管理中的价值。

Yuezhou Zhang, Amos Folarin, Callum Stewart, Hyunju Kim, Rongrong Zhong, Shaoxiong Sun, Richard JB DobsonMon, 09 Ma📊 stat

Two Localization Strategies for Sequential MCMC Data Assimilation with Applications to Nonlinear Non-Gaussian Geophysical Models

本文提出了一种基于序贯马尔可夫链蒙特卡洛（SMCMC）技术的局部数据同化方案，通过两种利用观测空间稀疏性的新策略，在避免粒子滤波权重退化问题的同时，有效处理了高维非线性非高斯地理物理模型（包括 SWOT 和漂流浮标数据）中的重尾观测噪声，并展示了其优于局部集合变换卡尔曼滤波（LETKF）的性能。

Hamza Ruzayqat, Hristo G. Chipilski, Omar KnioMon, 09 Ma📊 stat

stat.AP

Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density Estimators

A Model-Based Restricted Shapley Value to Measure the Players' Contribution to Shot Actions in Football

Don't Disregard the Data for Lack of a Likelihood: Bayesian Synthetic Likelihood for Enhanced Multilevel Network Meta-Regression

A mixed-frequency approach for exchange rates predictions

An Integrated Time-Varying Ornstein-Uhlenbeck Process for Jointly Modeling Individual and Population-Level Movement of Golden Eagles

Omnibus goodness-of-fit tests for univariate continuous distributions based on trigonometric moments

Learning Centre Partitions from Summaries

Data-Driven Bed Capacity Planning Using $M_t/G_t/\infty$ Queueing Models with an Application to Neonatal Intensive Care Units

Admittance Matrix Concentration Inequalities for Understanding Uncertain Power Networks

An intuitive rearranging of the Yates covariance decomposition for probabilistic verification of forecasts with the Brier score

Two-stage Adaptive Design Cluster Randomised Trials

Behavior-dLDS: A decomposed linear dynamical systems model for neural activity partially constrained by behavior

Test-then-Punish: A Statistical Approach to Repeated Games

Preoperative Decline and Postoperative Recovery of Wearable-Derived Physical Activity Over a Four-Year Perioperative Period in Total Knee and Hip Arthroplasty: Evidence from the All of Us Research Program

Two Localization Strategies for Sequential MCMC Data Assimilation with Applications to Nonlinear Non-Gaussian Geophysical Models

Modeling Animal Communication Using Multivariate Hawkes Processes with Additive Excitation and Multiplicative Inhibition

A Tutorial on Bayesian Analysis of Linear Shock Compression Data

Clustering-Based Outcome Models for Clinical Studies: A Scoping Review

Topological descriptors of foot clearance gait dynamics improve differential diagnosis of Parkinsonism

Large Wave Direction Data Modeling Using Wrapped Spatial Gaussian Markov Random Fields

stat.AP

Quantifying Membership Disclosure Risk for Tabular Synthetic Data Using Kernel Density Estimators

A Model-Based Restricted Shapley Value to Measure the Players' Contribution to Shot Actions in Football

Don't Disregard the Data for Lack of a Likelihood: Bayesian Synthetic Likelihood for Enhanced Multilevel Network Meta-Regression

A mixed-frequency approach for exchange rates predictions

An Integrated Time-Varying Ornstein-Uhlenbeck Process for Jointly Modeling Individual and Population-Level Movement of Golden Eagles

Omnibus goodness-of-fit tests for univariate continuous distributions based on trigonometric moments

Learning Centre Partitions from Summaries

Data-Driven Bed Capacity Planning Using Mt/Gt/∞M_t/G_t/\inftyMt​/Gt​/∞ Queueing Models with an Application to Neonatal Intensive Care Units

Admittance Matrix Concentration Inequalities for Understanding Uncertain Power Networks

An intuitive rearranging of the Yates covariance decomposition for probabilistic verification of forecasts with the Brier score

Two-stage Adaptive Design Cluster Randomised Trials

Behavior-dLDS: A decomposed linear dynamical systems model for neural activity partially constrained by behavior

Test-then-Punish: A Statistical Approach to Repeated Games

Preoperative Decline and Postoperative Recovery of Wearable-Derived Physical Activity Over a Four-Year Perioperative Period in Total Knee and Hip Arthroplasty: Evidence from the All of Us Research Program

Two Localization Strategies for Sequential MCMC Data Assimilation with Applications to Nonlinear Non-Gaussian Geophysical Models

Modeling Animal Communication Using Multivariate Hawkes Processes with Additive Excitation and Multiplicative Inhibition

A Tutorial on Bayesian Analysis of Linear Shock Compression Data

Clustering-Based Outcome Models for Clinical Studies: A Scoping Review

Topological descriptors of foot clearance gait dynamics improve differential diagnosis of Parkinsonism

Large Wave Direction Data Modeling Using Wrapped Spatial Gaussian Markov Random Fields

Data-Driven Bed Capacity Planning Using $M_t/G_t/\infty$ Queueing Models with an Application to Neonatal Intensive Care Units