Input-to-State Stable Coupled Oscillator Networks for Closed-form Model-based Control in Latent Space

This paper introduces a novel Coupled Oscillator Network (CON) model that overcomes key limitations in latent-space control by ensuring Lagrangian structure, global input-to-state stability, and an invertible input-force mapping, thereby enabling efficient closed-form control strategies for complex mechanical systems using only raw visual feedback.

Maximilian Stölzle, Cosimo Della Santina2026-03-10🤖 cs.LG

xTED: Cross-Domain Adaptation via Diffusion-Based Trajectory Editing

The paper proposes xTED, a flexible cross-domain adaptation framework that utilizes a diffusion model to edit and transform source domain trajectories into target domain distributions at the data level, thereby bridging domain gaps and enhancing policy learning performance without requiring complex domain-specific modeling.

Haoyi Niu, Qimao Chen, Tenglong Liu, Jianxiong Li, Guyue Zhou, Yi Zhang, Jianming Hu, Xianyuan Zhan2026-03-10🤖 cs.LG

Landscape of Policy Optimization for Finite Horizon MDPs with General State and Action

This paper establishes that policy gradient methods achieve global convergence with non-asymptotic sample complexity guarantees for finite-horizon MDPs with general state and action spaces by proving the Polyak-Łojasiewicz-Kurdyka condition holds, thereby providing the first theoretical foundations for optimizing multi-period inventory and stochastic cash balance systems.

Xin Chen, Yifan Hu, Minda Zhao2026-03-10🤖 cs.LG

Neural delay differential equations: learning non-Markovian closures for partially known dynamical systems

This paper introduces a constant-lag Neural Delay Differential Equations (NDDEs) framework, inspired by the Mori-Zwanzig formalism, to effectively learn non-Markovian dynamics from partially observed data by identifying memory effects through time delays, demonstrating superior performance over existing methods like LSTMs and ANODEs across synthetic, chaotic, and experimental datasets.

Thibault Monsel, Onofrio Semeraro, Lionel Mathelin, Guillaume Charpiat2026-03-10🤖 cs.LG

Open-World Reinforcement Learning over Long Short-Term Imagination

This paper introduces LS-Imagine, a novel approach that enhances open-world reinforcement learning by constructing a long short-term world model with goal-conditioned jumpy transitions and affordance maps, thereby enabling agents to efficiently explore vast state spaces and optimize for long-horizon rewards, as demonstrated by significant improvements in MineDojo.

Jiajian Li, Qi Wang, Yunbo Wang, Xin Jin, Yang Li, Wenjun Zeng, Xiaokang Yang2026-03-10🤖 cs.LG

Transformers as Implicit State Estimators: In-Context Learning in Dynamical Systems

This paper demonstrates that frozen transformers, when used in an in-context learning setting, can implicitly infer hidden states to accurately predict the outputs of both linear and nonlinear dynamical systems from noisy observations, achieving performance comparable to optimal and heuristic Bayesian filters without requiring test-time gradient updates or explicit knowledge of the system model.

Usman Akram, Haris Vikalo2026-03-10🤖 cs.LG

A Learned Proximal Alternating Minimization Algorithm and Its Induced Network for a Class of Two-block Nonconvex and Nonsmooth Optimization

This paper proposes a learned proximal alternating minimization (LPAM) algorithm and its corresponding interpretable network (LPAM-net) for solving two-block nonconvex and nonsmooth optimization problems, proving their convergence to Clarke stationary points and demonstrating superior performance in joint multi-modal MRI reconstruction.

Yunmei Chen, Lezhi Liu, Lei Zhang2026-03-10🤖 cs.LG

Autoassociative Learning of Structural Representations for Modeling and Classification in Medical Imaging

This paper introduces a neurosymbolic system that reconstructs medical images using visual primitives to generate high-level structural explanations, achieving superior classification accuracy and transparency compared to conventional deep learning models in diagnosing histological abnormalities.

Zuzanna Buchnajzer, Kacper Dobek, Stanisław Hapke, Daniel Jankowski, Krzysztof Krawiec2026-03-10🤖 cs.LG

Exploring Embedding Priors in Prompt-Tuning for Improved Interpretability and Control

This paper investigates the impact of embedding collapse in Prompt-Tuning by introducing embedding priors, revealing that models can effectively utilize embeddings from diverse activation regions and that distinct activation clusters exist for different task types, suggesting controllable posteriors could enhance interpretability and serve as a foundation for tasks like chain-of-thought distillation.

Sergey Sedov, Sumanth Bharadwaj Hachalli Karanam, Venu Gopal Kadamba2026-03-10🤖 cs.LG

From Pixels to Predicates: Learning Symbolic World Models via Pretrained Vision-Language Models

This paper proposes a method that leverages pretrained vision-language models to learn compact, abstract symbolic world models from limited visual demonstrations, enabling zero-shot generalization and long-horizon planning for complex robotic tasks across novel objects, environments, and goals.

Ashay Athalye, Nishanth Kumar, Tom Silver, Yichao Liang, Jiuguang Wang, Tomás Lozano-Pérez, Leslie Pack Kaelbling2026-03-10🤖 cs.LG

UFGraphFR: Graph Federation Recommendation System based on User Text description features

UFGraphFR is a novel federated recommendation framework that enhances privacy-preserving personalization by transforming user data into semantic text vectors to reconstruct global user relationship graphs on the server and employing Transformer architectures for behavior sequence modeling, thereby significantly outperforming existing baselines in accuracy and personalization.

Xudong Wang, Qingbo Hao, Yingyuan Xiao2026-03-10🤖 cs.LG