cs.LG papers | Gist.Science

Censored LLMs as a Natural Testbed for Secret Knowledge Elicitation

This paper leverages Chinese open-weight LLMs that censor politically sensitive topics as a natural testbed to evaluate honesty elicitation and lie detection techniques, finding that methods like few-shot prompting and self-classification effectively increase truthful responses and detect falsehoods, though no approach completely eliminates deception.

Helena Casademunt, Bartosz Cywiński, Khoi Tran + 3 more2026-03-06🤖 cs.AI

Cheap Thrills: Effective Amortized Optimization Using Inexpensive Labels

This paper proposes a novel three-stage framework that combines inexpensive imperfect labels, supervised pretraining, and self-supervised refinement to achieve effective amortized optimization with significantly reduced costs and improved performance across challenging domains.

Khai Nguyen, Petros Ellinas, Anvita Bhagavathula + 1 more2026-03-06🔢 math

POET-X: Memory-efficient LLM Training by Scaling Orthogonal Transformation

The paper introduces POET-X, a memory-efficient and scalable variant of the POET framework that utilizes optimized orthogonal equivalence transformations to enable the stable pretraining of billion-parameter large language models on a single GPU, overcoming the high memory and computational costs of the original implementation.

Zeju Qiu, Lixin Liu, Adrian Weller + 2 more2026-03-06🤖 cs.AI

RoboPocket: Improve Robot Policies Instantly with Your Phone

RoboPocket is a smartphone-based system that enhances robot imitation learning by using AR visual foresight to guide targeted data collection and asynchronous online finetuning, thereby doubling data efficiency and enabling instant policy iteration without requiring physical robot execution.

Junjie Fang, Wendi Chen, Han Xue + 7 more2026-03-06🤖 cs.AI

Recurrent Action Transformer with Memory

The paper proposes the Recurrent Action Transformer with Memory (RATE), a novel architecture that integrates a recurrent memory mechanism into transformers to overcome context length limitations in partially observable environments, demonstrating superior performance in memory-intensive offline reinforcement learning tasks while remaining competitive on standard benchmarks.

Egor Cherepanov, Alexey Staroverov, Alexey K. Kovalev + 1 more2026-03-05🤖 cs.AI

Crystal-GFN: sampling crystals with desirable properties and constraints

This paper introduces Crystal-GFN, a multi-environment, continuous-discrete GFlowNet that sequentially samples crystal structural attributes to efficiently generate diverse, valid materials with specific desirable properties and hard constraints, thereby accelerating the discovery of novel solid-state materials.

Mila AI4Science, :, Alex Hernandez-Garcia + 11 more2026-03-05🤖 cs.LG

GeoTop: Advancing Image Classification with Geometric-Topological Analysis

GeoTop is a mathematically principled framework that unifies Topological Data Analysis and Lipschitz-Killing Curvatures to resolve the diagnostic ambiguity of topologically equivalent structures by integrating robust topological signatures with precise geometric features, thereby achieving superior accuracy and interpretability in image classification tasks such as skin lesion diagnosis.

Mariem Abaach, Ian Morilla2026-03-05🤖 cs.LG

Sample-Optimal Locally Private Hypothesis Selection and the Provable Benefits of Interactivity

This paper presents a sample-optimal, locally differentially private algorithm for hypothesis selection that achieves the information-theoretic lower bound of $\Theta(k/(\alpha^2 \min\{\varepsilon^2, 1\}))$ using only $O(\log \log k)$ rounds of interaction, thereby demonstrating the provable power of interactivity to overcome the $\Omega(k \log k)$ sample complexity barrier inherent in non-interactive approaches.

Alireza F. Pour, Hassan Ashtiani, Shahab Asoodeh2026-03-05🤖 cs.LG

Graph Neural Networks in EEG-based Emotion Recognition: A Survey

This survey provides a comprehensive review and unified framework for Graph Neural Networks in EEG-based emotion recognition, categorizing existing methods by graph construction stages to offer guidance on their unique physiological foundations while outlining open challenges and future directions.

Chenyu Liu, Yuqiu Deng, Yihao Wu + 10 more2026-03-05🤖 cs.LG

List Sample Compression and Uniform Convergence

This paper investigates the applicability of classical generalization principles to list PAC learning, demonstrating that while uniform convergence remains equivalent to learnability, the sample compression conjecture fails as there exist list-learnable classes that cannot be compressed, even with arbitrarily large output lists.

Steve Hanneke, Shay Moran, Tom Waknine2026-03-05🤖 cs.LG

Agnostic Tomography of Stabilizer Product States

This paper introduces the concept of agnostic tomography and presents an efficient algorithm that learns a stabilizer product state approximating an arbitrary quantum state as well as the best possible match within that class, running in polynomial time for constant fidelity.

Sabee Grewal, Vishnu Iyer, William Kretschmer + 1 more2026-03-05⚛️ quant-ph

A Review of Reward Functions for Reinforcement Learning in the context of Autonomous Driving

This paper reviews and categorizes existing reward functions for reinforcement learning in autonomous driving into safety, comfort, progress, and traffic rule compliance, while highlighting their current limitations in standardization and context-awareness to propose future research directions for more robust and conflict-resolving reward designs.

Ahmed Abouelazm, Jonas Michel, J. Marius Zoellner2026-03-05🤖 cs.AI

Tracking solutions of time-varying variational inequalities

This paper extends tracking guarantees for time-varying variational inequalities to non-monotone functions and periodic problems without sublinear solution paths, while also demonstrating that the associated discrete dynamical systems can exhibit either convergence or provably chaotic behavior.

Hédi Hadiji, Sarah Sachs, Cristóbal Guzmán2026-03-05🤖 cs.LG

Black Box Meta-Learning Intrinsic Rewards

This paper introduces a black-box meta-learning approach that optimizes intrinsic rewards to enhance data efficiency and generalization in sparse-reward continuous control environments, demonstrating its effectiveness compared to extrinsic rewards and meta-learned advantage functions.

Octavio Pappalardo, Rodrigo Ramele, Juan Miguel Santos2026-03-05🤖 cs.LG

AuToMATo: An Out-Of-The-Box Persistence-Based Clustering Algorithm

The paper introduces AuToMATo, a novel persistence-based clustering algorithm that combines ToMATo with bootstrapping to provide robust, out-of-the-box performance without parameter tuning, making it particularly effective for topological data analysis applications like Mapper.

Marius Huber, Sara Kalisnik, Patrick Schnider2026-03-05🤖 cs.LG

A computational transition for detecting correlated stochastic block models by low-degree polynomials

This paper establishes that low-degree polynomial tests can distinguish between correlated sparse stochastic block models and independent Erdős-Rényi graphs if and only if the subsampling probability exceeds the minimum of Otter's constant and the Kesten-Stigum threshold, thereby identifying a sharp computational transition for detection and partial recovery.

Guanyi Chen, Jian Ding, Shuyang Gong + 1 more2026-03-05🤖 cs.LG

Diffusion & Adversarial Schrödinger Bridges via Iterative Proportional Markovian Fitting

This paper introduces the Iterative Proportional Markovian Fitting (IPMF) procedure, a unified framework that combines Iterative Markovian Fitting and Iterative Proportional Fitting to solve the Schrödinger Bridge problem with proven convergence and improved stability for applications like unpaired domain translation.

Sergei Kholkin, Grigoriy Ksenofontov, David Li + 6 more2026-03-05🤖 cs.LG

Toward Reasoning on the Boundary: A Mixup-based Approach for Graph Anomaly Detection

The paper proposes ANOMIX, a graph anomaly detection framework that enhances reasoning capabilities for identifying subtle boundary anomalies by synthesizing informative hard negatives through a mixup strategy that interpolates normal and abnormal subgraph representations.

Hwan Kim, Junghoon Kim, Sungsu Lim2026-03-05🤖 cs.AI

Curriculum-enhanced GroupDRO: Challenging the Norm of Avoiding Curriculum Learning in Subpopulation Shift Setups

This paper proposes Curriculum-enhanced Group Distributionally Robust Optimization (CeGDRO), a novel approach that strategically prioritizes hard bias-confirming and easy bias-conflicting samples to initialize model weights in an unbiased vantage point, thereby overcoming the limitations of traditional curriculum learning in subpopulation shift scenarios and achieving state-of-the-art performance across benchmark datasets.

Antonio Barbalau2026-03-05🤖 cs.AI

FlowCLAS: Enhancing Normalizing Flow Via Contrastive Learning For Anomaly Segmentation

The paper introduces FlowCLAS, a hybrid framework that enhances normalizing flows for anomaly segmentation by integrating a contrastive loss with outlier exposure to bridge the performance gap between generative and discriminative methods, achieving state-of-the-art results on multiple robotics benchmarks.

Chang Won Lee, Selina Leveugle, Svetlana Stolpner + 4 more2026-03-05🤖 cs.LG

← Previous Next →