cs.LG papers | Gist.Science

Regret-Optimal Q-Learning with Low Cost for Single-Agent and Federated Reinforcement Learning

This paper introduces two novel model-free algorithms, Q-EarlySettled-LowCost and FedQ-EarlySettled-LowCost, for single-agent and federated reinforcement learning that simultaneously achieve near-optimal regret, linear burn-in costs in state and action spaces, and logarithmic policy switching or communication costs, while also providing improved gap-dependent theoretical guarantees.

Haochen Zhang, Zhong Zheng, Lingzhou Xue2026-03-11🤖 cs.LG

Towards Robust Real-World Multivariate Time Series Forecasting: A Unified Framework for Dependency, Asynchrony, and Missingness

The paper introduces ChannelTokenFormer, a unified Transformer-based framework that simultaneously addresses the challenges of complex inter-channel dependencies, asynchronous sampling, and missing values to achieve robust real-world multivariate time series forecasting.

Jinkwan Jang, Hyungjin Park, Jinmyeong Choi, Taesup Kim2026-03-11🤖 cs.AI

Wavelet Scattering Transform and Fourier Representation for Offline Detection of Malicious Clients in Federated Learning

This paper introduces WAFFLE, a pre-training detection algorithm that utilizes Wavelet Scattering Transform or Fourier representations to identify and exclude malicious clients in Federated Learning using compressed, task-agnostic embeddings, thereby improving model robustness without accessing raw data.

Alessandro Licciardi, Davide Leo, Davide Carbone2026-03-11🤖 cs.LG

Uncovering Social Network Activity Using Joint User and Topic Interaction

This paper introduces the Mixture of Interacting Cascades (MIC), a joint user-topic interaction model based on marked multidimensional Hawkes processes that outperforms existing methods in modeling information spread and provides insightful visualizations of social network activity.

Gaspard Abel, Argyris Kalogeratos, Jean-Pierre Nadal, Julien Randon-Furling2026-03-11🤖 cs.LG

ConLID: Supervised Contrastive Learning for Low-Resource Language Identification

The paper proposes ConLID, a supervised contrastive learning approach that learns domain-invariant representations to significantly improve language identification performance for low-resource languages on out-of-domain data while maintaining accuracy for high-resource languages.

Negar Foroutan, Jakhongir Saydaliev, Ye Eun Kim, Antoine Bosselut2026-03-11🤖 cs.AI

Global Convergence of Iteratively Reweighted Least Squares for Robust Subspace Recovery

This paper establishes the first global linear convergence guarantees for a dynamic smoothing variant of Iteratively Reweighted Least Squares (IRLS) in robust subspace and affine subspace recovery, extending these theoretical results to nonconvex optimization on Riemannian manifolds and demonstrating their practical utility in low-dimensional neural network training.

Gilad Lerman, Kang Li, Tyler Maunu, Teng Zhang2026-03-11🤖 cs.LG

Service Placement in Small Cell Networks Using Distributed Best Arm Identification in Linear Bandits

This paper proposes a distributed multi-agent best-arm identification algorithm based on linear bandits to optimize service placement in small cell networks, enabling collaborative edge servers to efficiently identify the service that minimizes user latency under unknown demand and dynamic conditions.

Mariam Yahya, Aydin Sezgin, Setareh Maghsudi2026-03-11🤖 cs.LG

Convergence Rate for the Last Iterate of Stochastic Gradient Descent Schemes

This paper establishes new convergence rates for the last iterate of stochastic gradient descent and stochastic heavy ball methods in parametric settings with globally convex or non-convex objectives having $\gamma$ -Hölder gradients, utilizing discrete Gronwall's inequality to derive improved bounds without relying on the Robbins-Siegmund theorem.

Marcel Hudiani2026-03-11🤖 cs.LG

Operator Learning for Consolidation: An Architectural Comparison for DeepONet Variants

This study systematically evaluates and enhances DeepONet architectures for geotechnical consolidation problems, demonstrating that a physics-inspired, Fourier feature-enhanced model (Model 4) significantly outperforms standard configurations and achieves up to 1,000-fold computational speedups in 3D scenarios, thereby enabling efficient uncertainty quantification and advancing the integration of scientific machine learning in geotechnics.

Yongjin Choi, Chenying Liu, Jorge Macedo2026-03-11🤖 cs.LG

Langevin Flows for Modeling Neural Latent Dynamics

The paper introduces LangevinFlow, a physics-inspired sequential Variational Auto-Encoder that models neural latent dynamics using underdamped Langevin equations with a locally coupled oscillator potential, demonstrating superior performance over state-of-the-art baselines in capturing neural population dynamics and decoding behavioral metrics.

Yue Song, T. Anderson Keller, Yisong Yue, Pietro Perona, Max Welling2026-03-11🤖 cs.LG

Latent Policy Steering with Embodiment-Agnostic Pretrained World Models

This paper introduces Latent Policy Steering (LPS), a method that leverages embodiment-agnostic optical flow to pretrain a World Model on diverse datasets, which is then fine-tuned with limited target-embodiment demonstrations to steer and significantly improve visuomotor policies in low-data regimes.

Yiqi Wang, Mrinal Verghese, Jeff Schneider2026-03-11🤖 cs.AI

Multimodal LLM-assisted Evolutionary Search for Programmatic Control Policies

This paper introduces Multimodal Large Language Model-assisted Evolutionary Search (MLES), a novel framework that combines multimodal LLMs with evolutionary search and visual feedback to automatically generate transparent, verifiable, and human-aligned programmatic control policies that match the performance of deep reinforcement learning methods like PPO.

Qinglong Hu, Xialiang Tong, Mingxuan Yuan, Fei Liu, Zhichao Lu, Qingfu Zhang2026-03-11🤖 cs.LG

CTRL Your Shift: Clustered Transfer Residual Learning for Many Small Datasets

This paper introduces Clustered Transfer Residual Learning (CTRL), a meta-learning method that combines cross-domain residual learning with adaptive clustering to improve prediction accuracy and preserve source-level heterogeneity across numerous small datasets with distributional shifts, demonstrating superior performance over state-of-the-art benchmarks on five large-scale datasets including a Swiss asylum resettlement program.

Gauri Jain, Dominik Rothenhäusler, Kirk Bansak, Elisabeth Paulson2026-03-11🤖 cs.LG

Singing Syllabi with Virtual Avatars: Enhancing Student Engagement Through AI-Generated Music and Digital Embodiment

This paper proposes and evaluates a novel educational approach that uses AI-generated singing and virtual avatars to transform traditional text-based syllabi into engaging audiovisual performances, demonstrating that this method significantly improves student awareness and recall of critical course information.

Xinxing Wu2026-03-11🤖 cs.AI

MuFlex: A Scalable, Physics-based Platform for Multi-Building Flexibility Analysis and Coordination

MuFlex is a scalable, open-source, physics-based platform that integrates detailed EnergyPlus and Modelica building models with a standardized Reinforcement Learning interface to enable fair benchmarking and effective multi-building demand flexibility coordination, as demonstrated by a 12% peak demand reduction in a four-building case study.

Ziyan Wu, Ivan Korolija, Rui Tang2026-03-11⚡ eess

RF-Informed Graph Neural Networks for Accurate and Data-Efficient Circuit Performance Prediction

This paper introduces a lightweight, data-efficient Graph Neural Network framework that leverages RFIC domain-informed feature indexing and device-terminal graph abstractions to achieve state-of-the-art accuracy and superior cross-topology generalization in predicting the performance of diverse active radio frequency circuits.

Anahita Asadi, Leonid Popryho, Inna Partin-Vaisband2026-03-11🤖 cs.LG

Iterative In-Context Learning to Enhance LLMs Abstract Reasoning: The Case-Study of Algebraic Tasks

This paper proposes an iterative in-context learning methodology that optimizes few-shot example selection to significantly enhance large language models' systematic generalization and reasoning capabilities on algebraic tasks with non-standard rules, revealing that simpler examples can sometimes outperform complex ones.

Stefano Fioravanti, Matteo Zavatteri, Roberto Confalonieri, Kamyar Zeinalipour, Paolo Frazzetto, Alessandro Sperduti, Nicolò Navarin2026-03-11🤖 cs.LG

A Surrogate model for High Temperature Superconducting Magnets to Predict Current Distribution with Neural Network

This paper presents a fully connected residual neural network (FCRN) surrogate model trained on finite element method data to rapidly and accurately predict current density distributions and optimize the design of large-scale high-temperature superconducting magnets, overcoming the computational limitations of traditional simulations.

Mianjun Xiao, Peng Song, Yulong Liu, Cedric Korte, Ziyang Xu, Jiale Gao, Jiaqi Lu, Haoyang Nie, Qiantong Deng, Timing Qu2026-03-11🤖 cs.LG

Repulsive Monte Carlo on the sphere for the sliced Wasserstein distance

This paper investigates repulsive Monte Carlo methods for computing integrals on the unit sphere, specifically for the sliced Wasserstein distance, by benchmarking determinantal and repelled point processes while analyzing the UnifOrtho estimator to recommend randomized quasi-Monte Carlo for low dimensions and UnifOrtho for high dimensions.

Vladimir Petrovic, Rémi Bardenet, Agnès Desolneux2026-03-11🤖 cs.LG

Robot Control Stack: A Lean Ecosystem for Robot Learning at Scale

This paper introduces the Robot Control Stack (RCS), a lean and modular software ecosystem designed to bridge the gap between large-scale Vision-Language-Action model training and real-world robot deployment by unifying simulation and physical control, while validating its effectiveness through extensive evaluations of policies like Octo, OpenVLA, and Pi Zero.

Tobias Jülg, Pierre Krack, Seongjin Bien, Yannik Blei, Khaled Gamal, Ken Nakahara, Johannes Hechtl, Roberto Calandra, Wolfram Burgard, Florian Walter2026-03-11🤖 cs.LG

← Previous Next →