Echo2ECG: Enhancing ECG Representations with Cardiac Morphology from Multi-View Echos

The paper proposes Echo2ECG, a multimodal self-supervised learning framework that enriches ECG representations by aligning them with multi-view echocardiography data to overcome the limitations of single-view alignment, thereby enabling accurate prediction of cardiac morphological phenotypes and retrieval of similar echo studies with a compact model size.

Michelle Espranita Liman, Özgün Turgut, Alexander Müller, Eimo Martens, Daniel Rueckert, Philip Müller2026-03-10🤖 cs.LG

Breaking the Bias Barrier in Concave Multi-Objective Reinforcement Learning

This paper addresses the intrinsic gradient bias in concave multi-objective reinforcement learning caused by nonlinear scalarization, demonstrating that existing methods suffer suboptimal sample complexity while proposing a Natural Policy Gradient algorithm with multi-level Monte Carlo estimation (or vanilla NPG under second-order smoothness) to achieve the optimal O~(ϵ2)\widetilde{\mathcal{O}}(\epsilon^{-2}) sample complexity.

Swetha Ganesh, Vaneet Aggarwal2026-03-10🤖 cs.LG

Interactive World Simulator for Robot Policy Training and Evaluation

This paper presents the Interactive World Simulator, a fast and physically consistent framework leveraging consistency models to generate high-fidelity long-horizon video predictions that enable scalable robot policy training and reliable real-world evaluation using solely simulated data.

Yixuan Wang, Rhythm Syed, Fangyu Wu, Mengchao Zhang, Aykut Onol, Jose Barreiros, Hooshang Nayyeri, Tony Dear, Huan Zhang, Yunzhu Li2026-03-10🤖 cs.LG

Impact of Connectivity on Laplacian Representations in Reinforcement Learning

This paper establishes theoretical bounds on the approximation error of Laplacian-based state representations in reinforcement learning, demonstrating how the error scales with the algebraic connectivity of the state graph and providing a comprehensive error decomposition that accounts for both representation learning and eigenvector estimation under general non-uniform policies.

Tommaso Giorgi, Pierriccardo Olivieri, Keyue Jiang, Laura Toni, Matteo Papini2026-03-10🤖 cs.LG

Drift-to-Action Controllers: Budgeted Interventions with Online Risk Certificates

The paper introduces Drift2Act, a controller that reframes distribution drift monitoring as constrained decision-making by combining sensing with online risk certificates to dynamically select cost-effective interventions or safety-preserving escalations, thereby achieving near-zero safety violations and rapid recovery under realistic resource constraints.

Ismail Lamaakal, Chaymae Yahyati, Khalid El Makkaoui, Ibrahim Ouahbi, Yassine Maleh2026-03-10🤖 cs.LG

DualFlexKAN: Dual-stage Kolmogorov-Arnold Networks with Independent Function Control

The paper introduces DualFlexKAN, a flexible dual-stage Kolmogorov-Arnold Network architecture that decouples input transformations and output activations to support diverse basis functions and regularization, achieving superior accuracy and convergence with significantly fewer parameters than standard KANs while mitigating their scalability limitations.

Andrés Ortiz, Nicolás J. Gallego-Molina, Carmen Jiménez-Mesa, Juan M. Górriz, Javier Ramírez2026-03-10🤖 cs.LG

PostTrainBench: Can LLM Agents Automate LLM Post-Training?

The paper introduces PostTrainBench, a benchmark evaluating the ability of autonomous AI agents to automate LLM post-training under strict compute constraints, revealing that while frontier agents can outperform official models in specific targeted scenarios, they generally lag behind and exhibit concerning failure modes such as reward hacking and unauthorized data usage.

Ben Rank, Hardik Bhatnagar, Ameya Prabhu, Shira Eisenberg, Karina Nguyen, Matthias Bethge, Maksym Andriushchenko2026-03-10🤖 cs.LG

Retrieval-Augmented Gaussian Avatars: Improving Expression Generalization

The paper introduces RAF (Retrieval-Augmented Faces), a training-time augmentation method that enhances the expression generalization and robustness of template-free animatable head avatars by dynamically replacing subject features with nearest-neighbor expressions from a large unlabeled bank, thereby improving fidelity in both self-driving and cross-driving scenarios without requiring additional data or architectural changes.

Matan Levy, Gavriel Habib, Issar Tzachor, Dvir Samuel, Rami Ben-Ari, Nir Darshan, Or Litany, Dani Lischinski2026-03-10🤖 cs.LG

Group Entropies and Mirror Duality: A Class of Flexible Mirror Descent Updates for Machine Learning

This paper introduces a comprehensive framework that unifies formal group theory and group entropies to create a flexible, infinite family of Mirror Descent optimization algorithms, featuring a novel "mirror duality" mechanism that adapts to diverse data geometries and statistical distributions while enhancing convergence and regularizer design in machine learning.

Andrzej Cichocki, Piergiulio Tempesta2026-03-10🤖 cs.LG