Pretraining in Actor-Critic Reinforcement Learning for Robot Locomotion

This paper proposes a pretraining-finetuning paradigm for robot locomotion that leverages a task-agnostic exploration strategy to train a Proprioceptive Inverse Dynamics Model (PIDM), which is then used to warm-start actor-critic algorithms like PPO, resulting in significant improvements in sample efficiency and task performance across diverse robot embodiments.

Jiale Fan, Andrei Cramariuc, Tifanny Portela, Marco HutterTue, 10 Ma🤖 cs.LG

Reinforcing Numerical Reasoning in LLMs for Tabular Prediction via Structural Priors

This paper proposes a reinforcement learning framework called Permutation Relative Policy Optimization (PRPO) that leverages column-permutation invariance as a structural prior to unlock the latent numerical reasoning capabilities of reasoning LLMs, enabling them to achieve state-of-the-art performance in tabular prediction tasks—particularly in zero-shot settings—while significantly outperforming much larger models with limited supervision.

Pengxiang Cai, Zihao Gao, Wanchen Lian, Jintai ChenTue, 10 Ma🤖 cs.LG

SwiftTS: A Swift Selection Framework for Time Series Pre-trained Models via Multi-task Meta-Learning

SwiftTS is a swift selection framework for time series pre-trained models that leverages multi-task meta-learning and a lightweight dual-encoder architecture to efficiently predict the best model for unseen datasets without expensive fine-tuning, achieving state-of-the-art performance across diverse horizons and datasets.

Tengxue Zhang, Biao Ouyang, Yang Shu, Xinyang Chen, Chenjuan Guo, Bin YangTue, 10 Ma🤖 cs.LG

Balancing Interpretability and Performance in Motor Imagery EEG Classification: A Comparative Study of ANFIS-FBCSP-PSO and EEGNet

This study compares a transparent ANFIS-FBCSP-PSO model with the deep-learning benchmark EEGNet on motor imagery EEG data, revealing that the fuzzy-neural approach offers superior within-subject performance and interpretability while EEGNet demonstrates stronger cross-subject generalization, thereby providing practical guidance for selecting BCI systems based on specific design priorities.

Farjana Aktar, Mohd Ruhul Ameen, Akif Islam, Md Ekramul HamidTue, 10 Ma🤖 cs.LG

Towards Efficient Federated Learning of Networked Mixture-of-Experts for Mobile Edge Computing

This paper proposes a Networked Mixture-of-Experts (NMoE) system and a hybrid federated learning framework that enable collaborative inference and efficient, privacy-preserving training of large AI models on resource-constrained mobile edge devices by leveraging neighbor expertise and balancing personalization with generalization.

Song Gao, Songyang Zhang, Shusen Jing, Shuai Zhang, Xiangwei Zhou, Yue Wang, Zhipeng CaiTue, 10 Ma🤖 cs.LG

FATE: A Formal Benchmark Series for Frontier Algebra of Multiple Difficulty Levels

The paper introduces FATE, a new formal algebra benchmark series spanning from undergraduate exercises to PhD-level research problems, which reveals that current state-of-the-art LLMs struggle significantly with formalizing advanced mathematical reasoning, achieving near-zero accuracy on the most difficult tasks despite stronger natural-language performance.

Jiedong Jiang, Wanyi He, Yuefeng Wang, Guoxiong Gao, Yongle Hu, Jingting Wang, Nailin Guan, Peihao Wu, Chunbo Dai, Liang Xiao, Bin DongTue, 10 Ma🤖 cs.LG

Jr. AI Scientist and Its Risk Report: Autonomous Scientific Exploration from a Baseline Paper

This paper introduces "Jr. AI Scientist," an autonomous system that mimics a novice researcher's workflow to generate novel, scientifically valuable papers building on real academic works, while simultaneously evaluating its performance through rigorous automated and human assessments to identify both its capabilities and the significant risks and limitations of current AI-driven scientific exploration.

Atsuyuki Miyai, Mashiro Toyooka, Takashi Otonari, Zaiying Zhao, Kiyoharu AizawaTue, 10 Ma🤖 cs.LG

Distributionally Robust Self Paced Curriculum Reinforcement Learning

The paper proposes Distributionally Robust Self-Paced Curriculum Reinforcement Learning (DR-SPCRL), a method that adaptively schedules the robustness budget as a continuous curriculum to overcome the performance-robustness trade-off inherent in fixed-budget approaches, thereby achieving superior stability and an 11.8% improvement in episodic return under perturbations compared to existing strategies.

Anirudh Satheesh, Keenan Powell, Vaneet AggarwalTue, 10 Ma🤖 cs.LG

Adaptive Multi-view Graph Contrastive Learning via Fractional-order Neural Diffusion Networks

This paper introduces an augmentation-free multi-view graph contrastive learning framework that leverages learnable fractional-order neural diffusion networks to automatically generate a continuous spectrum of complementary views by adapting the diffusion scale to the data, thereby outperforming state-of-the-art methods in producing robust and expressive embeddings.

Yanan Zhao, Feng Ji, Jingyang Dai, Jiaze Ma, Keyue Jiang, Kai Zhao, Wee Peng TayTue, 10 Ma🤖 cs.LG

Angular Gradient Sign Method: Uncovering Vulnerabilities in Hyperbolic Networks

This paper introduces the Angular Gradient Sign Method, a novel adversarial attack for hyperbolic networks that leverages the geometric decomposition of gradients to apply perturbations solely along angular (semantic) directions, thereby achieving higher fooling rates and revealing unique vulnerabilities in hierarchical embeddings compared to conventional Euclidean-based methods.

Minsoo Jo, Dongyoon Yang, Taesup KimTue, 10 Ma🤖 cs.LG