Revealing Behavioral Plasticity in Large Language Models: A Token-Conditional Perspective

This paper introduces Token-Conditioned Reinforcement Learning (ToCoRL), a framework that leverages the intrinsic behavioral plasticity of Large Language Models to internalize and stabilize inference-time adaptations, enabling precise control over behavioral modes like switching from reasoning to direct answering without degrading overall capabilities.

Liyuan Mao, Le Yu, Jing Zhou, Chujie Zheng, Bowen Yu, Chang Gao, Shixuan Liu, An Yang, Weinan Zhang, JunYang Lin2026-03-10🤖 cs.LG

Meta-RL with Shared Representations Enables Fast Adaptation in Energy Systems

This paper introduces a novel Meta-RL framework featuring a hybrid actor-critic architecture with shared state representations and parameter-sharing mechanisms that significantly enhances sample efficiency and fast adaptation in non-stationary environments, as validated by superior performance on a decade-long real-world Building Energy Management Systems dataset.

Théo Zangato, Aomar Osmani, Pegah Alizadeh2026-03-10🤖 cs.LG

SYNAPSE: Framework for Neuron Analysis and Perturbation in Sequence Encoding

The paper introduces SYNAPSE, a systematic, training-free framework that analyzes and stress-tests Transformer models by extracting layer representations and applying forward-hook interventions to reveal domain-independent internal organization, functional stability through redundant neuron subsets, and specific vulnerabilities to small manipulations.

Jesús Sánchez Ochoa, Enrique Tomás Martínez Beltrán, Alberto Huertas Celdrán2026-03-10🤖 cs.LG

A prospective clinical feasibility study of a conversational diagnostic AI in an ambulatory primary care clinic

This prospective feasibility study demonstrates that a conversational AI system (AMIE) can safely and effectively conduct clinical history-taking and generate diagnostic suggestions in a real-world urgent care setting, achieving high patient satisfaction and diagnostic accuracy comparable to primary care providers while requiring no real-time human intervention.

Peter Brodeur, Jacob M. Koshy, Anil Palepu, Khaled Saab, Ava Homiar, Roma Ruparel, Charles Wu, Ryutaro Tanno, Joseph Xu, Amy Wang, David Stutz, Hannah M. Ferrera, David Barrett, Lindsey Crowley, Jihyeon Lee, Spencer E. Rittner, Ellery Wulczyn, Selena K. Zhang, Elahe Vedadi, Christine G. Kohn, Kavita Kulkarni, Vinay Kadiyala, Sara Mahdavi, Wendy Du, Jessica Williams, David Feinbloom, Renee Wong, Tao Tu, Petar Sirkovic, Alessio Orlandi, Christopher Semturs, Yun Liu, Juraj Gottweis, Dale R. Webster, Joëlle Barral, Katherine Chou, Pushmeet Kohli, Avinatan Hassidim, Yossi Matias, James Manyika, Rob Fields, Jonathan X. Li, Marc L. Cohen, Vivek Natarajan, Mike Schaekermann, Alan Karthikesalingam, Adam Rodman2026-03-10🤖 cs.LG

The Boiling Frog Threshold: Criticality and Blindness in World Model-Based Anomaly Detection Under Gradual Drift

This paper investigates world model-based anomaly detection under gradual observation drift, revealing a universal sharp detection threshold that depends on the interaction between detector sensitivity, noise floor, and environment-specific dynamics, while identifying critical failure modes such as the undetectability of sinusoidal drift and agent collapse prior to detection.

Zhe Hong2026-03-10🤖 cs.LG

Data-Driven Priors for Uncertainty-Aware Deterioration Risk Prediction with Multimodal Data

This paper introduces MedCertAIn\texttt{MedCertAIn}, a novel predictive uncertainty framework that leverages data-driven priors derived from cross-modal similarities and modality-specific corruptions to significantly enhance both the performance and reliability of multimodal in-hospital risk prediction using MIMIC-IV and MIMIC-CXR datasets.

L. Julián Lechuga López, Tim G. J. Rudner, Farah E. Shamout2026-03-10🤖 cs.LG

MUSA-PINN: Multi-scale Weak-form Physics-Informed Neural Networks for Fluid Flow in Complex Geometries

The paper introduces MUSA-PINN, a multi-scale weak-form Physics-Informed Neural Network that reformulates PDE constraints as integral conservation laws over hierarchical control volumes to overcome convergence pathologies and significantly improve accuracy and mass conservation in fluid flow simulations within complex geometries like Triply Periodic Minimal Surfaces.

Weizheng Zhang, Xunjie Xie, Hao Pan, Xiaowei Duan, Bingteng Sun, Qiang Du, Lin lu2026-03-10🤖 cs.LG

Integrating Lagrangian Neural Networks into the Dyna Framework for Reinforcement Learning

This paper proposes a model-based reinforcement learning framework that integrates Lagrangian neural networks into the Dyna architecture to enforce physical laws and improve prediction accuracy, demonstrating that state-estimation-based optimization converges faster than stochastic gradient-based methods during training.

Shreya Das, Kundan Kumar, Muhammad Iqbal, Outi Savolainen, Dominik Baumann, Laura Ruotsalainen, Simo Särkkä2026-03-10🤖 cs.LG

STRIDE: Structured Lagrangian and Stochastic Residual Dynamics via Flow Matching

The paper proposes STRIDE, a hybrid dynamics learning framework that combines a Lagrangian Neural Network for energy-consistent rigid-body mechanics with Conditional Flow Matching for stochastic residual interaction forces, achieving significant improvements in long-horizon prediction and contact force accuracy for robotic systems in unstructured environments.

Prakrut Kotecha, Ganga Nair B, Shishir Kolathaya2026-03-10🤖 cs.LG