cs.LG papers | Gist.Science

MSPT: Efficient Large-Scale Physical Modeling via Parallelized Multi-Scale Attention

The paper introduces MSPT, a novel architecture that leverages ball trees for efficient spatial partitioning and a dual-scale attention mechanism to enable scalable, memory-efficient, and high-accuracy physical simulations of millions of points on a single GPU across diverse industrial applications.

Pedro M. P. Curvo, Jan-Willem van de Meent, Maksim Zhdanov2026-03-10🤖 cs.LG

Dual Randomized Smoothing: Beyond Global Noise Variance

This paper proposes Dual Randomized Smoothing, a novel framework that overcomes the limitations of global noise variance by introducing input-dependent noise variances via a locally constant variance estimator, thereby achieving superior certified robustness across both small and large perturbation radii on CIFAR-10 and ImageNet.

Chenhao Sun, Yuhao Mao, Martin Vechev2026-03-10🤖 cs.LG

Dual-Robust Cross-Domain Offline Reinforcement Learning Against Dynamics Shifts

This paper introduces DROCO, a novel dual-robust cross-domain offline reinforcement learning algorithm that addresses both train-time and test-time dynamics shifts by employing a robust cross-domain Bellman operator alongside dynamic value penalty and Huber loss to enhance policy robustness and prevent value estimation errors.

Zhongjian Qiao, Rui Yang, Jiafei Lyu, Xiu Li, Zhongxiang Dai, Zhuoran Yang, Siyang Gao, Shuang Qiu2026-03-10🤖 cs.LG

Evolving Diffusion and Flow Matching Policies for Online Reinforcement Learning

The paper introduces GoRL, an algorithm-agnostic framework that stabilizes online reinforcement learning with expressive generative policies by decoupling optimization in a tractable latent space from action synthesis via a conditional generative decoder, achieving superior performance on challenging continuous-control tasks.

Chubin Zhang, Zhenglin Wan, Feng Chen, Fuchao Yang, Lang Feng, Yaxin Zhou, Xingrui Yu, Yang You, Ivor Tsang, Bo An2026-03-10🤖 cs.LG

Beyond Additivity: Sparse Isotonic Shapley Regression toward Nonlinear Explainability

This paper introduces Sparse Isotonic Shapley Regression (SISR), a unified framework that simultaneously learns a monotonic transformation to restore additivity and enforces sparsity constraints to provide robust, efficient, and theoretically grounded feature attributions for nonlinear, high-dimensional Explainable AI.

Jialai She2026-03-10🤖 cs.LG

Two-Step Data Augmentation for Masked Face Detection and Recognition: Turning Fake Masks to Real

This paper presents a two-step generative data augmentation framework combining rule-based mask warping and unpaired image-to-image translation to address the scarcity of masked face datasets, achieving performance improvements with minimal training data while explicitly noting its origins as a resource-constrained coursework project that lacked downstream quantitative evaluation.

Yan Yang, George Bebis, Mircea Nicolescu2026-03-10🤖 cs.LG

SALVE: Sparse Autoencoder-Latent Vector Editing for Mechanistic Control of Neural Networks

The paper introduces SALVE, a unified framework that combines sparse autoencoders and feature-level saliency mapping to discover, validate, and precisely edit neural network weights, enabling interpretable and robust control over both convolutional and transformer-based models.

Vegard Flovik2026-03-10🤖 cs.LG

Meta-RL Induces Exploration in Language Agents

The paper introduces LaMer, a Meta-RL framework that enhances language agents' ability to actively explore and adapt to novel environments at test time through cross-episode training and in-context policy reflection, significantly outperforming standard RL baselines across diverse tasks.

Yulun Jiang, Liangze Jiang, Damien Teney, Michael Moor, Maria Brbic2026-03-10🤖 cs.LG

ReDepth Anything: Test-Time Depth Refinement via Self-Supervised Re-lighting

Re-Depth Anything is a test-time self-supervised framework that enhances monocular depth estimation by fusing foundation models with large-scale 2D diffusion priors to perform label-free refinement via generative re-lighting and Score Distillation Sampling, achieving state-of-the-art results without direct depth tensor optimization.

Ananta R. Bhattarai, Helge Rhodin2026-03-10🤖 cs.LG

Concurrent training methods for Kolmogorov-Arnold networks: Disjoint datasets and FPGA implementation

This paper proposes three complementary strategies—pre-training tailored to Newton-Kaczmarz updates, training on disjoint data subsets with model merging, and FPGA-based parallelization—to overcome the sequential bottlenecks in Kolmogorov-Arnold network training and significantly accelerate convergence.

Andrew Polar, Michael Poluektov2026-03-10🤖 cs.LG

Latent Sculpting for Zero-Shot Generalization: A Manifold Learning Approach to Out-of-Distribution Anomaly Detection

The paper proposes "Latent Sculpting," a hierarchical two-stage architecture that combines a Transformer-based encoder with a Binary Latent Sculpting loss and a Masked Autoregressive Flow to enforce explicit geometric boundaries on benign data, achieving robust zero-shot generalization and high detection rates for out-of-distribution cyberattacks on the CIC-IDS-2017 benchmark.

Rajeeb Thapa Chhetri, Saurab Thapa, Avinash Kumar, Zhixiong Chen2026-03-10🤖 cs.LG

Certifying the Right to Be Forgotten: Primal-Dual Optimization for Sample and Label Unlearning in Vertical Federated Learning

This paper proposes FedORA, a primal-dual optimization framework that enables efficient and theoretically certified sample and label unlearning in Vertical Federated Learning by introducing a novel uncertainty-promoting loss function and adaptive strategies to minimize computational overhead while preserving model utility.

Yu Jiang, Xindi Tong, Ziyao Liu, Xiaoxi Zhang, Kwok-Yan Lam, Chee Wei Tan2026-03-10🤖 cs.LG

Network Traffic Analysis with Process Mining: The UPSIDE Case Study

This paper proposes a process mining-based method that analyzes online gaming network traffic to unsupervisedly characterize device states as interpretable Petri nets and accurately classify different video games, demonstrating its effectiveness through the UPSIDE case study involving Clash Royale and Rocket League.

Francesco Vitale, Paolo Palmiero, Massimiliano Rak, Nicola Mazzocca2026-03-10🤖 cs.LG

Topological Spatial Graph Coarsening

This paper proposes a parameter-free, equivariant topological spatial graph coarsening method that reduces graph size by collapsing short edges while preserving key topological features through a novel triangle-aware graph filtration adapted from persistent diagrams.

Anna Calissano, Etienne Lasalle2026-03-10🤖 cs.LG

Sparse Offline Reinforcement Learning with Corruption Robustness

This paper proposes actor-critic methods with sparse robust estimator oracles to achieve the first non-vacuous guarantees for learning near-optimal policies in high-dimensional sparse offline reinforcement learning under strong data corruption and single-policy concentrability, overcoming the limitations of traditional Least Square Value Iteration approaches in such regimes.

Nam Phuong Tran, Andi Nika, Goran Radanovic, Long Tran-Thanh, Debmalya Mandal2026-03-10🤖 cs.LG

Group Cross-Correlations with Faintly Constrained Filters

This paper proposes weaker constraints on filters for group convolutional neural networks that reduce the required number of nodes while resolving incompatibilities with non-compact stabilizers and generalizing results to non-transitive group actions and non-unimodular groups.

Benedikt Fluhr2026-03-10🤖 cs.LG

Reliable Grid Forecasting: State Space Models for Safety-Critical Energy Systems

This paper introduces an operator-legible evaluation framework centered on under-prediction risk to demonstrate that standard accuracy metrics fail to capture safety-critical grid forecasting needs, revealing that while explicit weather integration improves reliability, unconstrained probabilistic models often induce "fake safety" through excessive inflation, a problem solved by new Bias/OPR-constrained objectives.

Sunki Hong, Jisoo Lee2026-03-10⚡ eess

From Mice to Trains: Amortized Bayesian Inference on Graph Data

This paper proposes an amortized Bayesian inference framework for graph-structured data that combines permutation-invariant summary networks with neural posterior estimators to enable fast, likelihood-free inference on node, edge, and graph-level parameters, demonstrating its effectiveness through evaluations on synthetic benchmarks and real-world applications in biology and logistics.

Svenja Jedhoff, Elizaveta Semenova, Aura Raulo, Anne Meyer, Paul-Christian Bürkner2026-03-10🤖 cs.LG

DevBench: A Realistic, Developer-Informed Benchmark for Code Generation Models

DevBench is a realistic, telemetry-driven benchmark comprising 1,800 instances across six languages that evaluates LLMs on code completion tasks with a focus on ecological validity, contamination-free assessment, and detailed diagnostic insights to guide practical model selection and development.

Pareesa Ameneh Golnari, Adarsh Kumarappan, Wen Wen, Xiaoyu Liu, Gabriel Ryan, Yuting Sun, Shengyu Fu, Elsie Nallipogu2026-03-10🤖 cs.LG

A Component-Based Survey of Interactions between Large Language Models and Multi-Armed Bandits

This paper presents the first component-level survey systematically reviewing the bidirectional interactions between Large Language Models and Multi-Armed Bandits, highlighting how MAB algorithms optimize LLM workflows while LLMs redefine core MAB components to enhance adaptive decision-making.

Siguang Chen, Chunli Lv, Miao Xie2026-03-10🤖 cs.LG

← Previous Next →