cs.LG papers | Gist.Science

Interactive Benchmarks

This paper proposes "Interactive Benchmarks," a unified evaluation paradigm that assesses model intelligence through active information acquisition and reasoning under budget constraints in interactive proofs and games, demonstrating that current models still have significant room for improvement in these dynamic scenarios.

Baoqing Yue, Zihan Zhu, Yifan Zhang + 3 more2026-03-06💻 cs

CONE: Embeddings for Complex Numerical Data Preserving Unit and Variable Semantics

This paper introduces CONE, a hybrid transformer encoder that utilizes a novel composite embedding algorithm to preserve the semantics of units and variables for complex numerical data, achieving state-of-the-art performance in numerical reasoning tasks across diverse domains.

Gyanendra Shrestha, Anna Pyayt, Michael Gubanov2026-03-06💻 cs

KindSleep: Knowledge-Informed Diagnosis of Obstructive Sleep Apnea from Oximetry

KindSleep is a deep learning framework that integrates clinical knowledge with single-channel oximetry and patient data to accurately diagnose obstructive sleep apnea, achieving superior performance and enhanced transparency across large, diverse datasets.

Micky C Nnamdi, Wenqi Shi, Cheng Wan + 4 more2026-03-06💻 cs

Evaluating GPT-5 as a Multimodal Clinical Reasoner: A Landscape Commentary

This landscape commentary evaluates the GPT-5 family against GPT-4o, revealing substantial improvements in expert-level textual reasoning and multimodal synthesis that approach state-of-the-art performance in tasks like mammography, while highlighting that generalist models still lag behind specialized systems in perception-critical domains such as neuroradiology.

Alexandru Florea, Shansong Wang, Mingzhe Hu + 5 more2026-03-06💻 cs

ConTSG-Bench: A Unified Benchmark for Conditional Time Series Generation

This paper introduces ConTSG-Bench, a unified benchmark featuring a large-scale, multi-modal dataset and comprehensive metrics to systematically evaluate and analyze the performance, limitations, and future directions of conditional time series generation models.

Shaocheng Lan, Shuqi Gu, Zhangzhi Xiong + 1 more2026-03-06💻 cs

Distributional Reinforcement Learning with Information Bottleneck for Uncertainty-Aware DRAM Equalization

This paper proposes a distributional risk-sensitive reinforcement learning framework that integrates Information Bottleneck representations and Conditional Value-at-Risk optimization to achieve certified worst-case DRAM equalizer performance with significant speedups and uncertainty quantification, outperforming existing methods by up to 89.1% on real-world memory data.

Muhammad Usama, Dong Eui Chang2026-03-06💻 cs

Distributional Equivalence in Linear Non-Gaussian Latent-Variable Cyclic Causal Models: Characterization and Learning

This paper presents the first structural-assumption-free causal discovery method for linear non-Gaussian latent-variable cyclic models by establishing a graphical criterion for distributional equivalence, introducing edge rank constraints, and providing an algorithm to recover models up to this equivalence class.

Haoyue Dai, Immanuel Albrecht, Peter Spirtes + 1 more2026-03-06💻 cs

Diffusion Policy through Conditional Proximal Policy Optimization

This paper introduces a novel and efficient on-policy reinforcement learning method that trains diffusion policies by aligning policy iteration with the diffusion process, thereby overcoming computational bottlenecks in log-likelihood estimation while enabling multimodal behavior generation and entropy regularization across diverse benchmark tasks.

Ben Liu, Shunpeng Yang, Hua Chen2026-03-06💻 cs

Guiding Diffusion-based Reconstruction with Contrastive Signals for Balanced Visual Representation

This paper proposes Diffusion Contrastive Reconstruction (DCR), a method that injects contrastive signals derived from reconstructed images into the diffusion process to resolve gradient conflicts and jointly optimize both discriminative and detail-perceptive abilities, thereby overcoming the limitations of CLIP's visual encoder for balanced visual representation.

Boyu Han, Qianqian Xu, Shilong Bao + 4 more2026-03-06💻 cs

The Inductive Bias of Convolutional Neural Networks: Locality and Weight Sharing Reshape Implicit Regularization

This paper demonstrates that the architectural inductive biases of locality and weight sharing in convolutional neural networks fundamentally alter implicit regularization by coupling learned filters to low-dimensional patch manifolds, thereby enabling generalization on high-dimensional spherical data where fully connected networks provably fail.

Tongtong Liang, Esha Singh, Rahul Parhi + 2 more2026-03-06💻 cs

WhisperAlign: Word-Boundary-Aware ASR and WhisperX-Anchored Pyannote Diarization for Long-Form Bengali Speech

This paper presents WhisperAlign, a solution for the DL Sprint 4.0 that combines word-boundary-aware ASR using whisper-timestamped chunking and domain-fine-tuned Pyannote diarization anchored by WhisperX to achieve high-accuracy transcription and speaker separation for long-form Bengali speech.

Aurchi Chowdhury, Rubaiyat -E-Zaman, Sk. Ashrafuzzaman Nafees2026-03-06💻 cs

Quadratic polarity and polar Fenchel-Young divergences from the canonical Legendre polarity

This paper establishes a unified framework linking quadratic polarities to deformed Legendre transformations via linear algebra on homogeneous coordinates, defines polar divergences that generalize Fenchel-Young and Bregman divergences, and elucidates the reference duality in information geometry through total polar Fenchel-Young divergences.

Frank Nielsen, Basile Plus-Gourdon, Mahito Sugiyama2026-03-06💻 cs

On the Strengths and Weaknesses of Data for Open-set Embodied Assistance

This paper investigates the generalization capabilities of a multimodal foundation model fine-tuned on diverse synthetic interactive data for the novel task of Open-Set Corrective Assistance, demonstrating that effective open-set assistive intelligence requires datasets encompassing multimodal grounding, defect inference, and exposure to varied scenarios.

Pradyumna Tambwekar, Andrew Silva, Deepak Gopinath + 3 more2026-03-06🤖 cs.AI

Mitigating Instance Entanglement in Instance-Dependent Partial Label Learning

This paper proposes the Class-specific Augmentation based Disentanglement (CAD) framework to mitigate instance entanglement in instance-dependent partial label learning by employing intra-class feature alignment and inter-class weighted penalty mechanisms to clarify class boundaries and reduce confusion.

Rui Zhao, Bin Shi, Kai Sun + 1 more2026-03-06🤖 cs.LG

Multilevel Training for Kolmogorov Arnold Networks

This paper introduces a multilevel training framework for Kolmogorov-Arnold Networks (KANs) that leverages their structural equivalence to multichannel MLPs and the properties of spline basis functions to create a properly nested hierarchy of models, resulting in orders-of-magnitude improvements in training accuracy and speed, particularly for physics-informed neural networks.

Ben S. Southworth, Jonas A. Actor, Graham Harper + 1 more2026-03-06🔢 math

Missingness Bias Calibration in Feature Attribution Explanations

This paper introduces MCal, a lightweight post-hoc method that effectively corrects missingness bias in feature attribution explanations by fine-tuning a simple linear head on frozen models, outperforming or matching expensive retraining approaches across diverse medical benchmarks.

Shailesh Sridhar, Anton Xue, Eric Wong2026-03-06🤖 cs.LG

Why Is RLHF Alignment Shallow? A Gradient Analysis

This paper proves that standard RLHF alignment is inherently shallow because gradient signals vanish once a sequence's harmfulness is determined, and it proposes a recovery penalty objective to ensure alignment gradients persist throughout the entire generation process.

Robin Young2026-03-06🤖 cs.LG

Osmosis Distillation: Model Hijacking with the Fewest Samples

This paper introduces Osmosis Distillation, a novel model hijacking attack that exploits synthetic datasets generated by dataset distillation methods to compromise deep learning models in transfer learning with high success rates using only a few poisoned samples while maintaining utility on original tasks.

Yuchen Shi, Huajie Chen, Heng Xu, Zhiquan Liu, Jialiang Shen, Chi Liu, Shuai Zhou, Tianqing Zhu, Wanlei Zhou2026-03-06🔒 cs.CR

Causally Robust Reward Learning from Reason-Augmented Preference Feedback

This paper introduces ReCouPLe, a lightweight framework that leverages natural language rationales as causal guidance to train reward models that are robust to spurious correlations and capable of zero-shot transfer to novel tasks, significantly outperforming baselines in reward accuracy and downstream policy performance under distribution shifts.

Minjune Hwang, Yigit Korkmaz, Daniel Seita + 1 more2026-03-06🤖 cs.AI

Interpretable Pre-Release Baseball Pitch Type Anticipation from Broadcast 3D Kinematics

This paper presents a scalable, interpretable framework that achieves 80.4% accuracy in classifying eight professional baseball pitch types using only monocular 3D body kinematics, revealing that upper-body mechanics—particularly wrist position and trunk tilt—are the primary predictors while establishing an empirical ceiling for grip-based distinctions.

Jerrin Bright, Michelle Lu, John Zelek2026-03-06🤖 cs.AI

← Previous Next →