Some Super-approximation Rates of ReLU Neural Networks for Korobov Functions

This paper establishes nearly optimal super-approximation error bounds of order 2m2m and 2m22m-2 in LpL_p and Wp1W^1_p norms, respectively, for ReLU neural networks approximating Korobov functions by leveraging sparse grid finite elements and bit extraction, thereby demonstrating that neural network expressivity effectively overcomes the curse of dimensionality.

Yuwen Li, Guozhi Zhang2026-03-06💻 cs

Kernel Based Maximum Entropy Inverse Reinforcement Learning for Mean-Field Games

This paper proposes a kernel-based maximum causal entropy inverse reinforcement learning framework for infinite-horizon stationary mean-field games that models unknown rewards in a reproducing kernel Hilbert space to capture nonlinear structures, proves the algorithm's theoretical consistency via Fréchet differentiability, and demonstrates superior policy recovery performance over linear baselines in traffic routing scenarios while extending the approach to finite-horizon non-stationary settings.

Berkay Anahtarci, Can Deha Kariksiz, Naci Saldi2026-03-06🔢 math

Diffusion-Based Impedance Learning for Contact-Rich Manipulation Tasks

This paper introduces Diffusion-Based Impedance Learning, a framework that combines a Transformer-based diffusion model with energy-consistent impedance control to enable robots to learn and adapt contact-rich manipulation behaviors from teleoperated demonstrations, achieving high-precision performance and robust generalization in tasks like peg-in-hole insertion.

Noah Geiger, Tamim Asfour, Neville Hogan + 1 more2026-03-06💻 cs

Complexity-Regularized Proximal Policy Optimization

This paper introduces Complexity-Regularized Proximal Policy Optimization (CR-PPO), a novel algorithm that replaces standard entropy regularization with a self-regulating complexity term—defined as the product of Shannon entropy and disequilibrium—to maintain beneficial stochasticity while reducing sensitivity to hyperparameter tuning and avoiding the overriding of reward signals.

Luca Serfilippi, Giorgio Franceschelli, Antonio Corradi + 1 more2026-03-06💻 cs