cs.LG papers | Gist.Science

Estimating condition number with Graph Neural Networks

This paper proposes a fast graph neural network-based method for estimating the condition numbers of sparse matrices with linear complexity relative to the number of non-zero elements, demonstrating significant speedups over traditional Hager-Higham and Lanczos methods.

Erin Carson, Xinye Chen2026-03-12🤖 cs.LG

Robust Post-Training for Generative Recommenders: Why Exponential Reward-Weighted SFT Outperforms RLHF

This paper proposes and validates exponential reward-weighted SFT as a robust, fully offline post-training method for generative recommenders that eliminates reward hacking and propensity score requirements while offering theoretical guarantees and a controllable tradeoff between robustness and performance.

Keertana Chidambaram, Sanath Kumar Krishnamurthy, Qiuling Xu, Ko-Jen Hsiao, Moumita Bhattacharya2026-03-12🤖 cs.LG

Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework

This paper introduces ADMM-PnP with a novel AC-DC denoiser that resolves manifold mismatch and establishes convergence guarantees for score-based generative models within the ADMM framework, thereby improving solution quality across various inverse problems.

Rajesh Shrestha, Xiao Fu2026-03-12🤖 cs.LG

GSVD for Geometry-Grounded Dataset Comparison: An Alignment Angle Is All You Need

This paper proposes a geometry-grounded framework for comparing datasets using the Generalized Singular Value Decomposition (GSVD) to derive an interpretable "angle score" that quantifies whether individual samples are better explained by one dataset, the other, or both.

Eduarda de Souza Marques, Arthur Sobrinho Ferreira da Rocha, Joao Paixao, Heudson Mirandola, Daniel Sadoc Menasche2026-03-12🤖 cs.LG

Copula-ResLogit: A Deep-Copula Framework for Unobserved Confounding Effects

The paper introduces Copula-ResLogit, a novel deep learning framework that combines ResNet architectures with copula models to detect and mitigate unobserved confounding effects in travel demand analysis, thereby revealing true causal relationships in case studies involving pedestrian stress and travel mode choices.

Kimia Kamal, Bilal Farooq2026-03-12🤖 cs.LG

MultiwayPAM: Multiway Partitioning Around Medoids for LLM-as-a-Judge Score Analysis

The paper introduces MultiwayPAM, a novel tensor clustering method designed to simultaneously estimate cluster memberships and medoids for questions, answerers, and evaluators in LLM-as-a-Judge score tensors, thereby addressing computational costs and revealing inherent evaluator biases.

Chihiro Watanabe, Jingyu Sun2026-03-12📊 stat

Quantum entanglement provides a competitive advantage in adversarial games

This study demonstrates that quantum entanglement serves as a functional resource in competitive reinforcement learning, enabling hybrid quantum-classical agents trained on the game Pong to consistently outperform separable quantum circuits and match or exceed classical baselines by learning structurally distinct features that better model dynamic agent interactions.

Peiyong Wang, Kieran Hymas, James Quach2026-03-12⚛️ quant-ph

Hybrid Self-evolving Structured Memory for GUI Agents

This paper introduces HyMEM, a hybrid self-evolving structured memory system that combines discrete symbolic nodes with continuous embeddings in a graph format to significantly enhance the performance of open-source GUI agents, enabling them to match or surpass strong closed-source models on complex, long-horizon tasks.

Sibo Zhu, Wenyi Wu, Kun Zhou, Stephen Wang, Biwei Huang2026-03-12🤖 cs.AI

GaLoRA: Parameter-Efficient Graph-Aware LLMs for Node Classification

GaLoRA is a parameter-efficient framework that integrates structural information into large language models for text-attributed graph node classification, achieving state-of-the-art performance with only 0.24% of the parameters required for full fine-tuning.

Mayur Choudhary, Saptarshi Sengupta, Katerina Potika2026-03-12🤖 cs.LG

Regime-aware financial volatility forecasting via in-context learning

This paper introduces a regime-aware in-context learning framework that leverages pretrained large language models to forecast financial volatility by dynamically adapting to nonstationary market conditions through oracle-guided, regime-specific demonstrations without requiring parameter fine-tuning.

Saba Asaad, Shayan Mohajer Hamidi, Ali Bereyhi2026-03-12🤖 cs.LG

What do near-optimal learning rate schedules look like?

This paper introduces a search procedure to identify near-optimal learning rate schedule shapes across various workloads, revealing that while warmup and decay are robust features, commonly used schedules are suboptimal and the ideal shape is significantly influenced by hyperparameters like weight decay.

Hiroki Naganuma, Atish Agarwala, Priya Kasimbeg, George E. Dahl2026-03-12🤖 cs.LG

How to make the most of your masked language model for protein engineering

This paper introduces a flexible stochastic beam search sampling method for masked language models that optimizes protein properties by evaluating entire-sequence neighborhoods, demonstrating through extensive in silico and in vitro antibody engineering experiments that the choice of sampling strategy is at least as critical as the model itself.

Calvin McCarter, Nick Bhattacharya, Sebastian W. Ober, Hunter Elliott2026-03-12🧬 q-bio

Data-Driven Integration Kernels for Interpretable Nonlocal Operator Learning

This paper introduces a data-driven integration kernel framework that enhances the interpretability and efficiency of nonlocal operator learning in climate modeling by separating nonlocal information aggregation via learnable weighting functions from local nonlinear prediction, thereby achieving competitive performance with fewer parameters and clearer physical insights.

Savannah L. Ferretti, Jerry Lin, Sara Shamekh, Jane W. Baldwin, Michael S. Pritchard, Tom Beucler2026-03-12🤖 cs.LG

NasoVoce: A Nose-Mounted Low-Audibility Speech Interface for Always-Available Speech Interaction

NasoVoce is a nose-mounted speech interface that fuses acoustic and vibration signals to enable robust, discreet, and always-available voice interaction for AI, effectively overcoming the limitations of existing silent and whispered speech recognition methods.

Jun Rekimoto, Yu Nishimura, Bojian Yang2026-03-12🤖 cs.AI

Federated Active Learning Under Extreme Non-IID and Global Class Imbalance

This paper introduces FairFAL, an adaptive federated active learning framework that leverages lightweight prediction discrepancy and prototype-guided pseudo-labeling to dynamically select between global and local query models, effectively addressing the challenges of extreme non-IID data and global class imbalance to achieve superior performance over state-of-the-art methods.

Chen-Chen Zong, Sheng-Jun Huang2026-03-12🤖 cs.LG

On The Complexity of Best-Arm Identification in Non-Stationary Linear Bandits

This paper addresses the fixed-budget best-arm identification problem in non-stationary linear bandits by establishing a tighter, arm-set-dependent lower bound on error probability and proposing the $\textsf{Adjacent-BAI}$ algorithm, which utilizes an Adjacent-optimal design to achieve minimax-optimal performance that fully leverages the geometric structure of the arm set.

Leo Maynard-Zhang, Zhihan Xiong, Kevin Jamieson, Maryam Fazel2026-03-12📊 stat

HEAL: Hindsight Entropy-Assisted Learning for Reasoning Distillation

HEAL is an RL-free distillation framework that overcomes the "Teacher Ceiling" of standard rejection sampling by synergizing entropy-guided repair, uncertainty filtering, and progressive curriculum learning to transfer reasoning capabilities from Large Reasoning Models to smaller students.

Wenjing Zhang, Jiangze Yan, Jieyun Huang, Yi Shen, Shuming Shi, Ping Chen, Ning Wang, Zhaoxiang Liu, Kai Wang, Shiguo Lian2026-03-12🤖 cs.AI

Causal Concept Graphs in LLM Latent Space for Stepwise Reasoning

This paper introduces Causal Concept Graphs (CCG), a framework that combines task-conditioned sparse autoencoders with differentiable structure learning to map causal dependencies between interpretable latent features in LLMs, demonstrating through the Causal Fidelity Score that graph-guided interventions significantly enhance stepwise reasoning performance compared to existing tracing and random baselines.

Md Muntaqim Meherab, Noor Islam S. Mohammad, Faiza Feroz2026-03-12🤖 cs.LG

Optimal Expert-Attention Allocation in Mixture-of-Experts: A Scalable Law for Dynamic Model Design

This paper establishes a new scaling law for Mixture-of-Experts models by deriving an explicit power-law formula for the optimal compute allocation ratio between expert and attention layers, enabling more efficient model design under fixed computational budgets.

Junzhuo Li, Peijie Jiang, Changxin Tian, Jia Liu, Zhiqiang Zhang, Xuming Hu2026-03-12🤖 cs.LG

Variance-Aware Adaptive Weighting for Diffusion Model Training

This paper proposes a variance-aware adaptive weighting strategy that dynamically adjusts training weights based on loss variance across noise levels to address imbalanced training dynamics in diffusion models, resulting in improved generative performance and training stability on CIFAR datasets.

Nanlong Sun, Lei Shi2026-03-12🤖 cs.LG

← Previous Next →