PvP: Data-Efficient Humanoid Robot Learning with Proprioceptive-Privileged Contrastive Representations

This paper introduces PvP, a proprioceptive-privileged contrastive learning framework that enhances data efficiency and robustness in humanoid robot whole-body control by learning compact task-relevant representations without hand-crafted augmentations, supported by the new SRL4Humanoid evaluation framework.

Mingqi Yuan, Tao Yu, Haolin Song, Bo Li, Xin Jin, Hua Chen, Wenjun Zeng2026-03-12🤖 cs.LG

Saddle-to-Saddle Dynamics Explains A Simplicity Bias Across Neural Network Architectures

This paper presents a unifying theoretical framework demonstrating that gradient descent in diverse neural network architectures exhibits a simplicity bias by following saddle-to-saddle dynamics, which iteratively evolve near invariant manifolds to progressively learn solutions of increasing complexity such as higher rank, more kinks, or additional kernels and attention heads.

Yedi Zhang, Andrew Saxe, Peter E. Latham2026-03-12🤖 cs.LG

Gradient Dynamics of Attention: How Cross-Entropy Sculpts Bayesian Manifolds

This paper provides a first-order analysis demonstrating that cross-entropy training in transformers induces a coupled specialization of attention routing and value updates—functioning as a two-timescale EM procedure—that sculpts low-dimensional Bayesian manifolds, thereby explaining how gradient-based optimization enables precise probabilistic reasoning.

Naman Agarwal, Siddhartha R. Dalal, Vishal Misra2026-03-12📊 stat

Inferring Clinically Relevant Molecular Subtypes of Pancreatic Cancer from Routine Histopathology Using Deep Learning

The paper introduces PanSubNet, an interpretable deep learning framework that accurately predicts clinically relevant basal-like and classical molecular subtypes of pancreatic ductal adenocarcinoma directly from routine H&E-stained histology slides, offering a cost-effective and rapid alternative to traditional RNA-seq-based methods for precision oncology.

Abdul Rehman Akbar, Alejandro Levya, Ashwini Esnakula, Elshad Hasanov, Anne Noonan, Lingbin Meng, Susan Tsai, Vaibhav Sahai, Midhun Malla, Sarbajit Mukherjee, Upender Manne, Anil Parwani, Wei Chen, Ashish Manne, Muhammad Khalid Khan Niazi2026-03-12⚡ eess

Sampling via Stochastic Interpolants by Langevin-based Velocity and Initialization Estimation in Flow ODEs

This paper proposes a novel sampling method for unnormalized Boltzmann densities that leverages a sequence of Langevin samplers to efficiently simulate a probability flow ODE derived from linear stochastic interpolants by generating intermediate samples and robustly estimating the velocity field, while providing theoretical convergence guarantees and demonstrating effectiveness on challenging multimodal distributions and Bayesian inference tasks.

Chenguang Duan, Yuling Jiao, Gabriele Steidl, Christian Wald, Jerry Zhijian Yang, Ruizhe Zhang2026-03-12📊 stat