Boosting deep Reinforcement Learning using pretraining with Logical Options

This paper proposes Hybrid Hierarchical RL (H^2RL), a two-stage framework that leverages logical option-based pretraining to inject symbolic structure into deep reinforcement learning agents, effectively mitigating reward misalignment and improving long-horizon decision-making while outperforming existing neural, symbolic, and neuro-symbolic baselines.

Zihan Ye, Phil Chau, Raban Emunds, Jannis Blüml, Cedric Derstroff, Quentin Delfosse, Oleg Arenz, Kristian Kersting2026-03-09🤖 cs.AI

A recipe for scalable attention-based MLIPs: unlocking long-range accuracy with all-to-all node attention

This paper introduces AllScAIP, a scalable, attention-based machine-learning interatomic potential that leverages all-to-all node attention to effectively capture long-range interactions and achieve state-of-the-art accuracy across diverse molecular and material systems without relying on explicit physics-based terms.

Eric Qu, Brandon M. Wood, Aditi S. Krishnapriyan, Zachary W. Ulissi2026-03-09🔬 cond-mat.mtrl-sci

SCOPE: Scene-Contextualized Incremental Few-Shot 3D Segmentation

SCOPE introduces a plug-and-play framework for incremental few-shot 3D segmentation that enriches novel class prototypes by retrieving and fusing high-confidence pseudo-instances from unlabelled background regions, thereby achieving state-of-the-art performance on ScanNet and S3DIS while mitigating catastrophic forgetting without retraining the backbone.

Vishal Thengane, Zhaochong An, Tianjin Huang, Son Lam Phung, Abdesselam Bouzerdoum, Lu Yin, Na Zhao, Xiatian Zhu2026-03-09🤖 cs.LG

Data Collaboration Analysis with Orthonormal Basis Selection and Alignment

This paper introduces Orthonormal Data Collaboration (ODC), a method that enforces orthonormal bases to transform the alignment challenge into a closed-form Orthogonal Procrustes problem, thereby achieving orthogonal concordance, significantly reducing computational complexity, and improving accuracy without compromising privacy or communication efficiency.

Keiyu Nosaka, Yamato Suetake, Yuichi Takano + 1 more2026-03-06🔢 math

Localized Distributional Robustness in Submodular Multi-Task Subset Selection

This paper proposes a novel multi-task subset selection framework that achieves localized distributional robustness by introducing a relative-entropy regularization term, which is proven equivalent to maximizing a monotone composition of submodular functions and can be efficiently solved via greedy algorithms, as validated by experiments on satellite sensor selection and image summarization.

Ege C. Kaya, Abolfazl Hashemi2026-03-06🔢 math

Path Planning for Masked Diffusion Model Sampling

This paper introduces Path Planning (P2), a novel inference sampling strategy for Masked Diffusion Models that decomposes generation into planning and denoising stages to enable iterative token refinement, thereby establishing a new expanded evidence lower bound and achieving state-of-the-art performance across diverse domains including protein sequences, RNA, math, storytelling, and code generation.

Fred Zhangzhi Peng, Zachary Bezemek, Sawan Patel + 5 more2026-03-06💻 cs

Curse of Dimensionality in Neural Network Optimization

This paper demonstrates that training shallow neural networks with Lipschitz continuous activation functions to approximate smooth target functions suffers from the curse of dimensionality, as the population risk decays at a rate bounded by a power of time that depends inversely on the input dimension, regardless of whether the optimization is analyzed via empirical or population risk or through 2-Wasserstein gradient flow dynamics.

Sanghoon Na, Haizhao Yang2026-03-06🔢 math

FBFL: A Field-Based Coordination Approach for Data Heterogeneity in Federated Learning

This paper proposes Field-Based Federated Learning (FBFL), a novel macroprogramming-driven approach that utilizes distributed spatial leader election and self-organizing hierarchical architectures to effectively address data heterogeneity and centralization bottlenecks, demonstrating superior performance over state-of-the-art methods like FedAvg, FedProx, and Scaffold in non-IID scenarios while maintaining resilience against server failures.

Davide Domini, Gianluca Aguzzi, Lukas Esterle + 1 more2026-03-06💻 cs