cs.LG papers | Gist.Science

RECODE: Reasoning Through Code Generation for Visual Question Answering

The paper introduces RECODE, an agentic framework that enhances visual question answering by reverse-engineering structured visuals into executable code through iterative generation and selection, thereby transforming ambiguous perceptual tasks into verifiable symbolic reasoning problems that significantly outperform existing methods.

Junhong Shen, Mu Cai, Bo Hu, Ameet Talwalkar, David A Ross, Cordelia Schmid, Alireza Fathi2026-03-11🤖 cs.AI

REAP the Experts: Why Pruning Prevails for One-Shot MoE compression

This paper introduces REAP, a novel expert pruning method that outperforms existing merging techniques for compressing Mixture-of-Experts models in generative tasks by leveraging router gate-values and activation norms to minimize reconstruction error, achieving near-lossless compression even at 50% expert reduction.

Mike Lasby, Ivan Lazarevich, Nish Sinnadurai, Sean Lie, Yani Ioannou, Vithursan Thangarasa2026-03-11🤖 cs.AI

RL-100: Performant Robotic Manipulation with Real-World Reinforcement Learning

RL-100 is a unified real-world reinforcement learning framework that combines diffusion visuomotor policies with a clipped PPO objective and consistency distillation to achieve 100% success across 1,000 diverse robotic manipulation trials, matching or surpassing human experts while demonstrating robust zero-shot generalization and continuous deployment in dynamic environments.

Kun Lei, Huanyu Li, Dongjie Yu, Zhenyu Wei, Lingxiao Guo, Zhennan Jiang, Ziyu Wang, Shiyu Liang, Huazhe Xu2026-03-11🤖 cs.AI

Bradley-Terry Policy Optimization for Generative Preference Modeling

This paper introduces Bradley-Terry Policy Optimization (BTPO), a novel framework that derives a consistent Monte Carlo gradient estimator to effectively train large language models with chain-of-thought reasoning on non-verifiable pairwise preference tasks, overcoming the limitations of existing heuristic RL approaches.

Shengyu Feng, Yun He, Shuang Ma, Beibin Li, Yuanhao Xiong, Songlin Li, Karishma Mandyam, Julian Katz-Samuels, Shengjie Bi, Licheng Yu, Hejia Zhang, Karthik Abinav Sankararaman, Han Fang, Yiming Yang, Manaal Faruqui2026-03-11🤖 cs.LG

Personalized Collaborative Learning with Affinity-Based Variance Reduction

This paper introduces AffPCL, a personalized collaborative learning framework that utilizes affinity-based variance reduction to automatically adapt to unknown heterogeneity among agents, achieving sample complexity improvements that seamlessly interpolate between independent learning and linear speedup without requiring prior knowledge of system similarity.

Chenyu Zhang, Navid Azizan2026-03-11🤖 cs.LG

From Spatial to Actions: Grounding Vision-Language-Action Model in Spatial Foundation Priors

FALCON addresses the spatial reasoning limitations of existing 2D-based vision-language-action models by leveraging spatial foundation models to inject rich 3D geometric priors directly into the action head, achieving state-of-the-art performance across diverse simulation and real-world tasks without requiring architectural changes or specialized sensors.

Zhengshen Zhang, Hao Li, Yalun Dai, Zhengbang Zhu, Lei Zhou, Chenchen Liu, Dong Wang, Francis E. H. Tay, Sijin Chen, Ziwei Liu, Yuxiao Liu, Xinghang Li, Pan Zhou2026-03-11🤖 cs.AI

GraphKeeper: Graph Domain-Incremental Learning via Knowledge Disentanglement and Preservation

The paper proposes GraphKeeper, a novel framework for Graph Domain-Incremental Learning that addresses catastrophic forgetting through knowledge disentanglement and deviation-free preservation, achieving state-of-the-art performance across multiple graph domains while remaining compatible with various graph foundation models.

Zihao Guo, Qingyun Sun, Ziwei Zhang, Haonan Yuan, Huiping Zhuang, Xingcheng Fu, Jianxin Li2026-03-11🤖 cs.AI

Structured Matrix Scaling for Multi-Class Calibration

This paper proposes a structured matrix scaling approach for multi-class calibration that leverages theoretical insights from logistic regression, combined with structured regularization and robust optimization, to effectively manage the bias-variance tradeoff and achieve substantial performance gains over existing methods while providing an open-source implementation.

Eugène Berta, David Holzmüller, Michael I. Jordan, Francis Bach2026-03-11🤖 cs.AI

Lightweight Time Series Data Valuation on Time Series Foundation Models via In-Context Finetuning

This paper proposes LTSV, a lightweight and efficient method for valuing time series data in foundation models by leveraging in-context finetuning and temporal block aggregation to overcome the computational bottlenecks and temporal dependency limitations of traditional approaches.

Shunyu Wu, Tianyue Li, Yixuan Leng, Jingyi Suo, Jian Lou, Dan Li, See-Kiong Ng2026-03-11🤖 cs.AI

TSFM in-context learning for time-series classification of bearing-health status

This paper introduces a novel in-context learning approach using Time-Series Foundation Models (TSFMs) to classify bearing health status from vibration data without fine-tuning, enabling scalable, zero-shot maintenance solutions across varying operational conditions.

Michel Tokic, Slobodan Djukanovic, Anja von Beuningen, Cheng Feng2026-03-11🤖 cs.AI

Periodic Asynchrony: An On-Policy Approach for Accelerating LLM Reinforcement Learning

This paper proposes "Periodic Asynchrony," a framework that accelerates LLM reinforcement learning by decoupling inference and training into a provably on-policy asynchronous pipeline, achieving a 3- to 5-fold throughput improvement on NPU platforms without sacrificing accuracy.

Jian Lu2026-03-11🤖 cs.AI

An Interpretable Operator-Learning Model for Electric Field Profile Reconstruction in Discharges Based on the EFISH Method

This paper introduces Decoder-DeepONet (DDON), a novel interpretable operator-learning model that significantly outperforms previous neural network and classical methods in reconstructing electric field profiles from EFISH signals by offering superior accuracy, generalizability, and robustness to incomplete data while identifying optimal sampling windows.

Zhijian Yang, Edwin Setiadi Sugeng, Mhedine Alicherif, Tat Loon Chng2026-03-11🤖 cs.LG

Enhancing Retrieval-Augmented Generation with Entity Linking for Educational Platforms

This paper introduces ELERAG, an enhanced Retrieval-Augmented Generation system that integrates Wikidata-based Entity Linking and a hybrid re-ranking strategy to significantly improve factual accuracy in Italian educational question-answering, particularly outperforming standard methods in domain-specific contexts while demonstrating the importance of domain-adapted strategies.

Francesco Granata, Francesco Poggi, Misael Mongiovì2026-03-11🤖 cs.AI

SA $^{2}$ GFM: Enhancing Robust Graph Foundation Models with Structure-Aware Semantic Augmentation

This paper introduces SA $^{2}$ GFM, a robust Graph Foundation Model framework that enhances domain-adaptive representations and generalization by integrating structure-aware semantic augmentation, an information bottleneck mechanism, and expert adaptive routing to effectively mitigate domain noise, structural perturbations, and adversarial attacks.

Junhua Shi, Qingyun Sun, Haonan Yuan, Xingcheng Fu2026-03-11🤖 cs.LG

ADHint: Adaptive Hints with Difficulty Priors for Reinforcement Learning

ADHint is a novel reinforcement learning framework that enhances reasoning capabilities and generalization by integrating sample difficulty priors to adaptively schedule hint ratios and employing consistency-based gradient modulation with rollout difficulty posteriors to stabilize learning and prevent destructive imitation.

Feng Zhang, Zezhong Tan, Xinhong Ma, Ziqiang Dong, Xi Leng, Jianfei Zhao, Xin Sun, Yang Yang2026-03-11🤖 cs.LG

Directional Textual Inversion for Personalized Text-to-Image Generation

This paper introduces Directional Textual Inversion (DTI), a method that improves personalized text-to-image generation by constraining learned token embeddings to a fixed magnitude and optimizing only their direction on a hypersphere, thereby preventing norm inflation, enhancing text fidelity, and enabling smooth semantic interpolation between concepts.

Kunhee Kim, NaHyeon Park, Kibeom Hong, Hyunjung Shim2026-03-11🤖 cs.LG

Do Spatial Descriptors Improve Multi-DoF Finger Movement Decoding from HD sEMG?

This study demonstrates that while the multichannel linear descriptors-based block field method (MLD-BFM) achieves the highest accuracy in decoding five finger-joint degrees of freedom from HD sEMG, its performance is not statistically superior to conventional time-domain features, though it significantly outperforms dimensionality reduction methods, highlighting the critical importance of preserving spatial resolution in high-density recordings.

Ricardo Gonçalves Molinari, Leonardo Abdala Elias2026-03-11🤖 cs.LG

EMFusion: Conditional Diffusion Framework for Trustworthy Frequency Selective EMF Forecasting in Wireless Networks

This paper introduces EMFusion, a conditional multivariate diffusion-based framework that leverages a residual U-Net with cross-attention and imputation-based sampling to provide accurate, uncertainty-quantified, frequency-selective electromagnetic field forecasts for wireless network planning, significantly outperforming existing baseline models.

Zijiang Yan, Yixiang Huang, Jianhua Pei, Hina Tabassum, Luca Chiaraviglio2026-03-11🤖 cs.AI

Enhancing Reconstruction Capability of Wavelet Transform Amorphous Radial Distribution Function via Machine Learning Assisted Parameter Tuning

This study introduces the enhanced WT-RDF+ framework, which leverages machine learning-assisted parameter tuning to overcome amplitude accuracy limitations in reconstructing Radial Distribution Functions for amorphous Ge-Se and Ag-Ge-Se systems, thereby outperforming standard ML benchmarks even with limited training data.

Deriyan Senjaya, Stephen Ekaputra Limantoro2026-03-11🔬 cond-mat.mtrl-sci

The Affine Divergence: Aligning Activation Updates Beyond Normalisation

This paper identifies a systematic mismatch in activation updates during gradient descent, proposing that correcting this issue through first-principles derivation not only reinterprets the role of normalization but also yields a new, empirically superior alternative called "PatchNorm" that challenges conventional affine-based approaches.

George Bird2026-03-11🤖 cs.LG

← Previous Next →

cs.LG