cs.LG papers | Gist.Science

Boosting deep Reinforcement Learning using pretraining with Logical Options

This paper proposes Hybrid Hierarchical RL (H^2RL), a two-stage framework that leverages logical option-based pretraining to inject symbolic structure into deep reinforcement learning agents, effectively mitigating reward misalignment and improving long-horizon decision-making while outperforming existing neural, symbolic, and neuro-symbolic baselines.

Zihan Ye, Phil Chau, Raban Emunds, Jannis Blüml, Cedric Derstroff, Quentin Delfosse, Oleg Arenz, Kristian Kersting2026-03-09🤖 cs.AI

A recipe for scalable attention-based MLIPs: unlocking long-range accuracy with all-to-all node attention

This paper introduces AllScAIP, a scalable, attention-based machine-learning interatomic potential that leverages all-to-all node attention to effectively capture long-range interactions and achieve state-of-the-art accuracy across diverse molecular and material systems without relying on explicit physics-based terms.

Eric Qu, Brandon M. Wood, Aditi S. Krishnapriyan, Zachary W. Ulissi2026-03-09🔬 cond-mat.mtrl-sci

SCOPE: Scene-Contextualized Incremental Few-Shot 3D Segmentation

SCOPE introduces a plug-and-play framework for incremental few-shot 3D segmentation that enriches novel class prototypes by retrieving and fusing high-confidence pseudo-instances from unlabelled background regions, thereby achieving state-of-the-art performance on ScanNet and S3DIS while mitigating catastrophic forgetting without retraining the backbone.

Vishal Thengane, Zhaochong An, Tianjin Huang, Son Lam Phung, Abdesselam Bouzerdoum, Lu Yin, Na Zhao, Xiatian Zhu2026-03-09🤖 cs.LG

BEVLM: Distilling Semantic Knowledge from LLMs into Bird's-Eye View Representations

The paper proposes BEVLM, a framework that bridges the gap between spatially consistent Bird's-Eye View representations and Large Language Models by distilling semantic knowledge, thereby significantly enhancing both cross-view reasoning accuracy and safety-critical end-to-end driving performance.

Thomas Monninger, Shaoyuan Xie, Qi Alfred Chen, Sihao Ding2026-03-09🤖 cs.AI

Linear Multidimensional Regression with Interactive Fixed-Effects

This paper proposes a Neyman-orthogonal estimator for linear multidimensional panel data with interactive fixed-effects that combines factor model methods with a weighted-within transformation to achieve parametric consistency and asymptotic normality, demonstrated through an application to beer demand elasticity.

Hugo Freeman2026-03-06💻 cs

Zeroth-Order primal-dual Alternating Projection Gradient Algorithms for Nonconvex Minimax Problems with Coupled linear Constraints

This paper proposes two novel single-loop zeroth-order primal-dual algorithms, ZO-PDAPG and ZO-RMPDPG, that achieve state-of-the-art iteration complexity guarantees for solving nonconvex-(strongly) concave minimax problems with coupled linear constraints under both deterministic and stochastic settings.

Huiling Zhang, Zi Xu, Yuhong Dai2026-03-06🔢 math

Data Collaboration Analysis with Orthonormal Basis Selection and Alignment

This paper introduces Orthonormal Data Collaboration (ODC), a method that enforces orthonormal bases to transform the alignment challenge into a closed-form Orthogonal Procrustes problem, thereby achieving orthogonal concordance, significantly reducing computational complexity, and improving accuracy without compromising privacy or communication efficiency.

Keiyu Nosaka, Yamato Suetake, Yuichi Takano + 1 more2026-03-06🔢 math

Localized Distributional Robustness in Submodular Multi-Task Subset Selection

This paper proposes a novel multi-task subset selection framework that achieves localized distributional robustness by introducing a relative-entropy regularization term, which is proven equivalent to maximizing a monotone composition of submodular functions and can be efficiently solved via greedy algorithms, as validated by experiments on satellite sensor selection and image summarization.

Ege C. Kaya, Abolfazl Hashemi2026-03-06🔢 math

Distilling Privileged Information for Dubins Traveling Salesman Problems with Neighborhoods

This paper proposes a novel two-phase learning framework that distills privileged information from LKH-generated expert trajectories to enable a non-holonomic vehicle to solve Dubins Traveling Salesman Problems with Neighborhoods approximately 50 times faster than traditional methods while ensuring all task points are visited.

Min Kyu Shin, Su-Jeong Park, Seung-Keol Ryu + 2 more2026-03-06💻 cs

HEroBM: a deep equivariant graph neural network for universal backmapping from coarse-grained to all-atom representations

The paper introduces HEroBM, a deep equivariant graph neural network that utilizes a hierarchical, local-principle-based approach to achieve accurate, transferable, and universal backmapping from coarse-grained to all-atom molecular representations across diverse chemical systems.

Daniele Angioletti, Stefano Raniolo, Vittorio Limongelli2026-03-06🔬 physics

Learning to Cover: Online Learning and Optimization with Irreversible Decisions

This paper proposes an asymptotically optimal online learning and optimization algorithm for minimizing irreversible facility openings under a coverage target, demonstrating that a policy balancing initial limited exploration with subsequent rapid exploitation achieves sub-linear regret that converges exponentially fast to its infinite-horizon limit.

Alexandre Jacquillat, Michael Lingzhi Li2026-03-06🔢 math

Parallel Split Learning with Global Sampling

This paper introduces Parallel Split Learning with Global Sampling (GPSL), a server-driven scheme that fixes the global batch size and uses pooled-level proportions to draw local samples without replacement, thereby eliminating rounding bias, stabilizing optimization under non-IID data, and achieving centralized-like accuracy with negligible overhead.

Mohammad Kohankhaki, Ahmad Ayad, Mahdi Barhoush + 1 more2026-03-06💻 cs

Towards a Fairer Non-negative Matrix Factorization

This paper proposes a min-max formulation for Non-negative Matrix Factorization (NMF) to mitigate group bias, deriving specific optimization algorithms and demonstrating through experiments that while this approach can improve fairness, it may increase individual error, necessitating application-specific trade-off considerations.

Lara Kassab, Erin George, Deanna Needell + 3 more2026-03-06💻 cs

An Experimental Study on Fairness-aware Machine Learning for Credit Scoring Problems

This paper presents a comprehensive experimental study demonstrating that fairness-aware machine learning models achieve a superior balance between predictive accuracy and fairness compared to traditional classification models in the context of credit scoring.

Huyen Giang Thi Thu, Thang Viet Doan, Ha-Bang Ban + 1 more2026-03-06💻 cs

Path Planning for Masked Diffusion Model Sampling

This paper introduces Path Planning (P2), a novel inference sampling strategy for Masked Diffusion Models that decomposes generation into planning and denoising stages to enable iterative token refinement, thereby establishing a new expanded evidence lower bound and achieving state-of-the-art performance across diverse domains including protein sequences, RNA, math, storytelling, and code generation.

Fred Zhangzhi Peng, Zachary Bezemek, Sawan Patel + 5 more2026-03-06💻 cs

Curse of Dimensionality in Neural Network Optimization

This paper demonstrates that training shallow neural networks with Lipschitz continuous activation functions to approximate smooth target functions suffers from the curse of dimensionality, as the population risk decays at a rate bounded by a power of time that depends inversely on the input dimension, regardless of whether the optimization is analyzed via empirical or population risk or through 2-Wasserstein gradient flow dynamics.

Sanghoon Na, Haizhao Yang2026-03-06🔢 math

Generalization Bounds for Markov Algorithms through Entropy Flow Computations

This paper extends entropy flow-based generalization bounds from specific noisy algorithms to all learning processes governed by time-homogeneous Markov dynamics by introducing a new exact entropy flow formula and linking generalization error to ergodic properties via modified logarithmic Sobolev inequalities.

Benjamin Dupuis, Maxime Haddouche, George Deligiannidis + 1 more2026-03-06💻 cs

Sink equilibria and the attractors of learning in games

This paper refutes the conjecture that sink equilibria are in one-to-one correspondence with the attractors of the replicator dynamic by presenting counterexamples based on "local sources," while establishing "pseudoconvexity" as a sufficient condition for this correspondence to hold in two-player games.

Oliver Biggar, Christos Papadimitriou2026-03-06💻 cs

FBFL: A Field-Based Coordination Approach for Data Heterogeneity in Federated Learning

This paper proposes Field-Based Federated Learning (FBFL), a novel macroprogramming-driven approach that utilizes distributed spatial leader election and self-organizing hierarchical architectures to effectively address data heterogeneity and centralization bottlenecks, demonstrating superior performance over state-of-the-art methods like FedAvg, FedProx, and Scaffold in non-IID scenarios while maintaining resilience against server failures.

Davide Domini, Gianluca Aguzzi, Lukas Esterle + 1 more2026-03-06💻 cs

Double Momentum and Error Feedback for Clipping with Fast Rates and Differential Privacy

This paper introduces Clip21-SGD2M, a novel federated learning algorithm that combines clipping, heavy-ball momentum, and error feedback to achieve both optimal convergence rates for non-convex problems with heterogeneous data and near-optimal differential privacy guarantees without restrictive assumptions.

Rustem Islamov, Samuel Horvath, Aurelien Lucchi + 2 more2026-03-06🔢 math

← Previous Next →