cs.LG papers | Gist.Science

Beyond Word Error Rate: Auditing the Diversity Tax in Speech Recognition through Dataset Cartography

This paper proposes a robust auditing framework for automatic speech recognition systems that moves beyond traditional Word Error Rate by introducing the Sample Difficulty Index and semantic metrics to quantify and mitigate the "diversity tax" disproportionately affecting marginalized speakers.

Ting-Hui Cheng, Line H. Clemmensen, Sneha Das2026-03-06🤖 cs.LG

Whispering to a Blackbox: Bootstrapping Frozen OCR with Visual Prompts

This paper introduces "Whisperer," a sample-efficient visual prompting framework that bootstraps frozen OCR models by using a four-stage behavioral cloning curriculum to learn diffusion-based preprocessors that enhance degraded text inputs, achieving an 8% absolute reduction in Character Error Rate without modifying the downstream model's weights.

Samandar Samandarov, Nazirjon Ismoiljonov, Abdullah Sattorov + 1 more2026-03-06🤖 cs.AI

Layer by layer, module by module: Choose both for optimal OOD probing of ViT

This paper demonstrates that distribution shift is the primary cause of performance degradation in deeper layers of Vision Transformers and reveals that optimal out-of-distribution probing requires selecting between feedforward network activations and normalized self-attention outputs depending on the severity of the shift.

Ambroise Odonnat, Vasilii Feofanov, Laetitia Chapel + 2 more2026-03-06🤖 cs.LG

Bayesian Supervised Causal Clustering

This paper proposes Bayesian Supervised Causal Clustering (BSCC), a novel framework that identifies homogeneous patient subgroups by simultaneously clustering individuals based on their covariate profiles and treatment effects, and validates its practical utility through simulations and real-world data from the International Stroke Trial.

Luwei Wang, Nazir Lone, Sohan Seth2026-03-06🤖 cs.LG

Knowledge Divergence and the Value of Debate for Scalable Oversight

This paper establishes a formal geometric framework linking AI debate and RLAIF by demonstrating that the value of debate scales with knowledge divergence between models, transitioning from negligible benefit to essential oversight as representations diverge, while identifying specific regimes where debate unlocks inaccessible outcomes or risks coordination failure.

Robin Young2026-03-06🤖 cs.LG

Latent Policy Steering through One-Step Flow Policies

The paper proposes Latent Policy Steering (LPS), a robust offline reinforcement learning method that achieves state-of-the-art performance by using a differentiable one-step MeanFlow policy to backpropagate original-action-space Q-gradients directly to a latent actor, thereby eliminating the need for proxy latent critics and sensitive hyperparameter tuning while ensuring policies remain within dataset support.

Hokyun Im, Andrey Kolobov, Jianlong Fu + 1 more2026-03-06🤖 cs.LG

WavSLM: Single-Stream Speech Language Modeling via WavLM Distillation

WavSLM is a single-stream speech language model that achieves competitive speech generation and consistency without text supervision by quantizing and distilling WavLM representations into a single codebook for autoregressive next-chunk prediction.

Luca Della Libera, Cem Subakan, Mirco Ravanelli2026-03-06🤖 cs.AI

How important are the genes to explain the outcome - the asymmetric Shapley value as an honest importance metric for high-dimensional features

This paper proposes using asymmetric Shapley values as a superior metric for quantifying the importance of high-dimensional genomic features in clinical prediction models, addressing limitations of traditional approaches by accounting for collinearity and known causal directions, and provides efficient algorithms validated through a colorectal cancer progression study.

Mark A. van de Wiel, Jeroen Goedhart, Martin Jullum + 1 more2026-03-06🤖 cs.LG

GALACTIC: Global and Local Agnostic Counterfactuals for Time-series Clustering

This paper introduces GALACTIC, a unified framework that bridges local and global counterfactual explainability for unsupervised time-series clustering by generating minimal perturbations to cross cluster boundaries and employing a provably efficient submodular optimization algorithm to derive concise, non-redundant global summaries of these transitions.

Christos Fragkathoulas, Eleni Psaroudaki, Themis Palpanas + 1 more2026-03-06🤖 cs.AI

FairFinGAN: Fairness-aware Synthetic Financial Data Generation

The paper proposes FairFinGAN, a WGAN-based framework that integrates fairness constraints via a classifier to generate synthetic financial data that effectively mitigates bias against protected attributes while maintaining high utility for downstream predictive tasks.

Tai Le Quy, Dung Nguyen Tuan, Trung Nguyen Thanh + 3 more2026-03-06🤖 cs.LG

Bayes with No Shame: Admissibility Geometries of Predictive Inference

This paper demonstrates that predictive inference is governed by four distinct, pairwise non-nested admissibility geometries—Blackwell risk dominance, anytime-valid supermartingales, marginal coverage, and Cesàro approachability—each offering a unique certificate of optimality and proving that admissibility is irreducibly relative to the chosen criterion rather than a universal property.

Nicholas G. Polson, Daniel Zantedeschi2026-03-06🔢 math

On the Statistical Optimality of Optimal Decision Trees

This paper establishes a comprehensive statistical theory for globally optimal empirical risk minimization decision trees by deriving sharp oracle inequalities and minimax optimal rates over a novel piecewise sparse heterogeneous anisotropic Besov space, thereby providing rigorous theoretical guarantees for their performance in high-dimensional regression and classification under both sub-Gaussian and heavy-tailed noise settings.

Zineng Xu, Subhroshekhar Ghosh, Yan Shuo Tan2026-03-06🔢 math

Preserving Continuous Symmetry in Discrete Spaces: Geometric-Aware Quantization for SO(3)-Equivariant GNNs

This paper proposes Geometric-Aware Quantization (GAQ), a framework that enables efficient, low-bit inference for SO(3)-equivariant Graph Neural Networks by decoupling magnitude and direction to rigorously preserve continuous symmetry, thereby achieving significant speedups and memory reductions on molecular simulation benchmarks without compromising physical consistency.

Haoyu Zhou, Ping Xue, Hao Zhang + 1 more2026-03-06🤖 cs.LG

InfoFlow KV: Information-Flow-Aware KV Recomputation for Long Context

This paper proposes InfoFlow KV, an information-flow-aware method that uses attention-norm signals and global positional reordering to selectively recompute key-value caches, thereby improving the efficiency and accuracy of retrieval-augmented generation for long-context tasks.

Xin Teng, Canyu Zhang, Shaoyi Zheng + 3 more2026-03-06🤖 cs.LG

Learning Causal Structure of Time Series using Best Order Score Search

This paper introduces TS-BOSS, a scalable, score-based algorithm for learning causal structures in multivariate time series that extends the Best Order Score Search framework with dynamic Bayesian networks and grow-shrink trees, demonstrating superior performance in high auto-correlation regimes compared to standard constraint-based methods.

Irene Gema Castillo Mansilla, Urmi Ninad2026-03-06🤖 cs.AI

Embedded Inter-Subject Variability in Adversarial Learning for Inertial Sensor-Based Human Activity Recognition

This paper proposes a novel deep adversarial framework that explicitly integrates inter-subject variability to learn subject-invariant feature representations, thereby significantly improving generalization and classification performance in inertial sensor-based Human Activity Recognition across unseen individuals.

Francisco M. Calatrava-Nicolás, Shoko Miyauchi, Vitor Fortes Rey + 3 more2026-03-06🤖 cs.LG

Robust Node Affinities via Jaccard-Biased Random Walks and Rank Aggregation

The paper introduces TopKGraphs, a robust and interpretable method for estimating node similarity that combines Jaccard-biased random walks with rank aggregation to outperform standard similarity measures and embedding-based approaches across diverse network types.

Bastian Pfeifer, Michael G. Schimek2026-03-06🤖 cs.LG

On the Necessity of Learnable Sheaf Laplacians

This paper challenges the necessity of learnable restriction maps in Sheaf Neural Networks by demonstrating that a simpler baseline with fixed identity maps achieves comparable performance on heterophilic graphs and does not suffer from the oversmoothing predicted by theoretical diffusion analysis.

Ferran Hernandez Caralt, Mar GonzÃ lez i CatalÃ, Adrián Bazaga + 1 more2026-03-06🤖 cs.LG

Harnessing Synthetic Data from Generative AI for Statistical Inference

This paper provides a comprehensive statistical review of synthetic data generated by modern AI models, outlining their benefits and limitations while offering principled frameworks and practical recommendations to ensure their valid and reliable use in scientific inference and prediction.

Ahmad Abdel-Azim, Ruoyu Wang, Xihong Lin2026-03-06🤖 cs.LG

MobileFetalCLIP: Selective Repulsive Knowledge Distillation for Mobile Fetal Ultrasound Analysis

The paper introduces MobileFetalCLIP, a framework utilizing Selective Repulsive Knowledge Distillation to train a compact 11.4M parameter student model that outperforms its 304M parameter teacher in fetal ultrasound analysis while enabling real-time deployment on mobile devices.

Numan Saeed, Fadillah Adamsyah Maani, Mohammad Yaqub2026-03-06🤖 cs.AI

← Previous Next →