cs.LG papers | Gist.Science

$P^2$ GNN: Two Prototype Sets to boost GNN Performance

The paper introduces $P^2$ GNN, a plug-and-play technique that leverages two sets of prototypes to enrich global context and denoise local neighborhoods, thereby significantly boosting the performance of Message Passing Graph Neural Networks across diverse node recommendation and classification tasks.

Arihant Jain, Gundeep Arora, Anoop Saladi, Chaosheng Dong2026-03-11🤖 cs.LG

The Reasoning Trap -- Logical Reasoning as a Mechanistic Pathway to Situational Awareness

This paper argues that advancements in logical reasoning for large language models inadvertently create a mechanistic pathway to dangerous situational awareness and strategic deception, necessitating new safety frameworks like the RAISE model to mitigate these emergent risks.

Subramanyam Sahoo, Aman Chadha, Vinija Jain, Divya Chaudhary2026-03-11🤖 cs.AI

The Radio-Frequency Transformer for Signal Separation

This paper presents a fully data-driven signal separator using a modified SoundStream tokenizer and a transformer trained with cross-entropy loss, which achieves significant improvements in separating radio-frequency signals from non-Gaussian interference compared to conventional methods.

Egor Lifar, Semyon Savkin, Rachana Madhukara, Tejas Jayashankar, Yury Polyanskiy, Gregory W. Wornell2026-03-11🤖 cs.LG

Emotion is Not Just a Label: Latent Emotional Factors in LLM Processing

This paper investigates emotion as a latent factor influencing LLM attention and reasoning, introducing the AURA-QA dataset and an emotional regularization framework that demonstrably improves reading comprehension performance across both emotionally varying and standard benchmarks.

Benjamin Reichman, Adar Avasian, Samuel Webster, Larry Heck2026-03-11🤖 cs.AI

MM-Zero: Self-Evolving Multi-Model Vision Language Models From Zero Data

MM-Zero is the first RL-based framework to enable Vision Language Models to self-evolve from zero data by employing a multi-role system (Proposer, Coder, and Solver) trained with Group Relative Policy Optimization to generate visual concepts, render them via code, and solve multimodal reasoning tasks without any seed images.

Zongxia Li, Hongyang Du, Chengsong Huang, Xiyang Wu, Lantao Yu, Yicheng He, Jing Xie, Xiaomin Wu, Zhichao Liu, Jiarui Zhang, Fuxiao Liu2026-03-11🤖 cs.LG

Strategically Robust Multi-Agent Reinforcement Learning with Linear Function Approximation

This paper proposes \texttt{RQRE-OVI}, an optimistic value iteration algorithm that computes the unique and smooth Risk-Sensitive Quantal Response Equilibrium (RQRE) in general-sum Markov games with linear function approximation, offering a principled trade-off between performance and robustness that outperforms traditional Nash equilibrium approaches in both theoretical guarantees and empirical stability.

Jake Gonzales, Max Horwitz, Eric Mazumdar, Lillian J. Ratliff2026-03-11🤖 cs.LG

Beyond Test-Time Training: Learning to Reason via Hardware-Efficient Optimal Control

This paper introduces Test-Time Control (TTC), a hardware-efficient neural layer that embeds finite-horizon optimal control planning directly into pretrained LLMs via a symplectic LQR solver, significantly boosting mathematical reasoning performance without requiring test-time training.

Peihao Wang, Shan Yang, Xijun Wang, Tesi Xiao, Xin Liu, Changlong Yu, Yu Lou, Pan Li, Zhangyang Wang, Ming Lin, René Vidal2026-03-11🤖 cs.LG

A Generative Sampler for distributions with possible discrete parameter based on Reversibility

This paper proposes a unified, target-gradient-free generative sampling framework that enforces time-reversibility constraints via Maximum Mean Discrepancy minimization between forward and backward Markov trajectories, enabling efficient sampling from complex continuous, discrete, and hybrid distributions using only energy evaluations.

Lei Li, Zhen Wang, Lishuo Zhang2026-03-11🤖 cs.LG

Efficient Reasoning at Fixed Test-Time Cost via Length-Aware Attention Priors and Gain-Aware Training

This paper proposes a training-only framework combining a length-aware attention prior (RPA) and a gain-aware controller (Guardian) to enhance reasoning efficiency and reduce validation loss in Transformers without increasing test-time computational costs or latency.

Rian Atri2026-03-11🤖 cs.LG

Transductive Generalization via Optimal Transport and Its Application to Graph Node Classification

This paper introduces efficient, representation-based transductive generalization bounds for graph node classification using optimal transport and Wasserstein distances, which not only correlate strongly with empirical performance but also explain the non-monotonic relationship between GNN depth and generalization error through the analysis of distributional transformations.

MoonJeong Park, Seungbeom Lee, Kyungmin Kim, Jaeseung Heo, Seunghyuk Cho, Shouheng Li, Sangdon Park, Dongwoo Kim2026-03-11🤖 cs.LG

DendroNN: Dendrocentric Neural Networks for Energy-Efficient Classification of Event-Based Data

This paper introduces DendroNN, a novel dendrocentric neural network that leverages non-differentiable sequence detection and a rewiring phase to efficiently classify event-based spatiotemporal data, achieving competitive accuracy with up to 4x higher energy efficiency than state-of-the-art neuromorphic hardware through a dedicated asynchronous digital architecture.

Jann Krausse, Zhe Su, Kyrus Mama, Maryada, Klaus Knobloch, Giacomo Indiveri, Jürgen Becker2026-03-11🤖 cs.AI

On Regret Bounds of Thompson Sampling for Bayesian Optimization

This paper advances the theoretical understanding of Gaussian process Thompson sampling (GP-TS) in Bayesian optimization by establishing a regret lower bound, deriving improved upper bounds for cumulative regret and the second moment of regret, and providing expected lenient regret bounds that address gaps in existing analyses compared to GP-UCB.

Shion Takeno, Shogo Iwazaki2026-03-11🤖 cs.LG

Proxy-Guided Measurement Calibration

This paper proposes a proxy-guided framework using variational autoencoders and causal modeling to identify and correct systematic measurement errors in aggregate outcome variables by leveraging proxy variables that depend on true outcomes but are independent of bias mechanisms.

Saketh Vishnubhatla, Shu Wan, Andre Harrison, Adrienne Raglin, Huan Liu2026-03-11🤖 cs.LG

A Gaussian Comparison Theorem for Training Dynamics in Machine Learning

This paper establishes a non-asymptotic Gaussian comparison theorem based on Gordon's theorem to rigorously validate dynamic mean-field expressions and derive refined iterative approximations for the training dynamics of machine learning models, such as perceptrons, under Gaussian mixture data.

Ashkan Panahi2026-03-11🤖 cs.LG

CLoE: Expert Consistency Learning for Missing Modality Segmentation

The paper proposes CLoE, a consistency-driven framework that enhances missing-modality medical image segmentation by enforcing decision-level agreement among modality experts on both global and clinically critical foreground regions, thereby improving robustness and generalization compared to state-of-the-art methods.

Xinyu Tong, Meihua Zhou, Bowu Fan, Haitao Li2026-03-11🤖 cs.AI

Reward-Zero: Language Embedding Driven Implicit Reward Mechanisms for Reinforcement Learning

The paper introduces Reward-Zero, a general-purpose implicit reward mechanism that leverages language embeddings to transform natural-language task descriptions into dense, semantically grounded progress signals, thereby accelerating training, stabilizing learning, and improving generalization for reinforcement learning agents without requiring task-specific reward engineering.

Heng Zhang, Haddy Alchaer, Arash Ajoudani, Yu She2026-03-11🤖 cs.LG

TA-GGAD: Testing-time Adaptive Graph Model for Generalist Graph Anomaly Detection

This paper introduces TA-GGAD, a testing-time adaptive graph foundation model that addresses the cross-domain generalization challenge in anomaly detection by identifying and modeling the "Anomaly Disassortativity" issue, thereby achieving state-of-the-art performance across diverse real-world graphs with a single training phase.

Xiong Zhang, Hong Peng, Changlong Fu, Xin Jin, Yun Yang, Cheng Xie2026-03-11🤖 cs.AI

Interactive 3D visualization of surface roughness predictions in additive manufacturing: A data-driven framework

This paper presents a data-driven framework that combines a multilayer perceptron trained on experimental data augmented by a conditional generative adversarial network with an interactive 3D web interface to predict and visualize surface roughness in material extrusion additive manufacturing, enabling optimized process planning and part orientation.

Engin Deniz Erkan, Elif Surer, Ulas Yaman2026-03-11🤖 cs.LG

Democratising Clinical AI through Dataset Condensation for Classical Clinical Models

This paper introduces a differentially private, zero-order optimization framework that extends dataset condensation to non-differentiable clinical models, enabling the creation of compact, privacy-preserving synthetic datasets that facilitate the democratization of clinical data sharing without compromising model utility.

Anshul Thakur, Soheila Molaei, Pafue Christy Nganjimi, Joshua Fieggen, Andrew A. S. Soltan, Danielle Belgrave, Lei Clifton, David A. Clifton2026-03-11🤖 cs.AI

From Representation to Clusters: A Contrastive Learning Approach for Attributed Hypergraph Clustering

The paper proposes CAHC, an end-to-end contrastive learning framework for attributed hypergraph clustering that integrates node-level and hyperedge-level objectives with joint embedding and cluster assignment optimization to outperform existing two-stage methods.

Li Ni, Shuaikang Zeng, Lin Mu, Longlong Lin2026-03-11🤖 cs.LG

← Previous Next →

cs.LG

P2P^2P2GNN: Two Prototype Sets to boost GNN Performance