Synthetic data for ratemaking: imputation-based methods vs adversarial networks and autoencoders

This paper benchmarks Multivariate Imputation by Chained Equations (MICE) against deep generative models like Variational Autoencoders and Conditional Tabular GANs for synthetic ratemaking data, finding that MICE offers a simpler yet high-fidelity alternative that effectively preserves statistical distributions and supports robust Generalized Linear Model training.

Yevhen Havrylenko, Meelis Käärik, Artur TuttarTue, 10 Ma🤖 cs.LG

Exposing the Illusion of Fairness: Auditing Vulnerabilities to Distributional Manipulation Attacks

This paper investigates how malicious auditees can construct fairness-compliant yet representative-looking samples from non-compliant distributions to deceive auditors, formalizes these manipulation strategies using optimal transport and entropic projections, and proposes statistical tests to detect such distributional manipulation attacks.

Valentin Lafargue, Adriana Laurindo Monteiro, Emmanuelle Claeys, Laurent Risser, Jean-Michel LoubesTue, 10 Ma🤖 cs.LG

Optimising antibiotic switching via forecasting of patient physiology

This paper proposes a neural process-based decision support system that forecasts patient vital sign trajectories to probabilistically predict readiness for switching from intravenous to oral antibiotics, thereby outperforming random selection and historical decision-learning approaches in identifying eligible patients across US and UK datasets.

Magnus Ross, Nel Swanepoel, Akish Luintel, Emma McGuire, Ingemar J. Cox, Steve Harris, Vasileios LamposTue, 10 Ma🤖 cs.LG

Constructing Evidence-Based Tailoring Variables for Adaptive Interventions

This paper proposes a systematic framework for empirically developing evidence-based tailoring variables for adaptive interventions, arguing that while secondary data can be used, specifically designed optimization experiments (such as SMARTs or factorial designs) provide the most direct causal evidence for determining optimal measurement times, decision points, and cutoffs.

John J. Dziak, Inbal Nahum-ShaniThu, 12 Ma📊 stat

Offline Dynamic Inventory and Pricing Strategy: Addressing Censored and Dependent Demand

This paper proposes a novel data-driven framework using offline reinforcement learning and survival analysis to estimate optimal pricing and inventory control policies in sequential decision-making environments characterized by censored and dependent demand, overcoming challenges like missing profit information and non-stationarity through high-order Markov decision process approximations and finite-sample regret guarantees.

Korel Gundem, Zhengling QiThu, 12 Ma📊 stat

Controlling the joint local false discovery rate is more powerful than meta-analysis methods in joint analysis of summary statistics from multiple genome-wide association studies

This paper proposes a novel summary-statistics-based joint analysis method that controls the joint local false discovery rate (Jlfdr), demonstrating through simulations and empirical data that it offers superior power over traditional meta-analysis methods, particularly when analyzing heterogeneous genome-wide association study datasets.

Wei Jiang, Weichuan YuThu, 12 Ma📊 stat

Don't Disregard the Data for Lack of a Likelihood: Bayesian Synthetic Likelihood for Enhanced Multilevel Network Meta-Regression

This paper proposes a Bayesian Synthetic Likelihood (BSL) framework integrated with Hamiltonian Monte Carlo to enhance Multilevel Network Meta-Regression by leveraging available subgroup-level summary statistics to impute missing individual-level covariates, thereby improving population-adjusted treatment comparisons in the presence of incomplete data.

Harlan Campbell, Charles C. Margossian, Jeroen P. Jansen, Paul GustafsonThu, 12 Ma📊 stat

A Model-Based Restricted Shapley Value to Measure the Players' Contribution to Shot Actions in Football

This paper introduces a novel framework combining the expected Goal Action (xGA) metric with a Player's Restricted Shapley (PRS) statistic to quantify individual player contributions in football shot actions by modeling team interactions through tactically admissible passing coalitions, as demonstrated in an analysis of the 2022/23 Serie A season.

Mattia Cefis, Rodolfo Metulini, Maurizio CarpitaThu, 12 Ma📊 stat

Estimands and the Choice of Non-Inferiority Margin under ICH E9(R1)

This paper demonstrates that under the ICH E9(R1) framework, the choice of a non-inferiority margin must be explicitly aligned with the specific estimand, as variations in intercurrent event strategies and historical trial designs significantly influence the estimated treatment effect and the validity of the constancy assumption.

Tobias Mütze, Helle Lynggaard, Sunita Rehal, Oliver N. Keene, Marian Mitroiu, David WrightThu, 12 Ma📊 stat

Bayesian Design and Analysis of Precision Trials with Partial Borrowing

Motivated by a gastric cancer trial, this paper proposes a Bayesian framework for precision clinical trials that utilizes individually weighted external data to partially borrow information for estimating subgroup effects, supported by a simulation study comparing its performance to dynamic borrowing and demonstrating its application in determining sample sizes and decision boundaries.

Shirin Golchi, Satoshi MoritaThu, 12 Ma📊 stat