cs.LG papers | Gist.Science

Hallucination is a Consequence of Space-Optimality: A Rate-Distortion Theorem for Membership Testing

This paper establishes a rate-distortion theorem demonstrating that hallucinations in large language models are an inevitable consequence of information-theoretic optimal memory compression when storing sparse facts, forcing the model to confidently assign high scores to non-facts rather than abstain.

Anxin Guo, Jingwei Li2026-03-12💬 cs.CL

Grounding Generated Videos in Feasible Plans via World Models

The paper proposes GVP-WM, a planning method that leverages learned action-conditioned world models to ground zero-shot video-generated plans into dynamically feasible action sequences by optimizing latent trajectories that satisfy physical constraints while preserving semantic alignment with the original video.

Christos Ziakas, Amir Bar, Alessandra Russo2026-03-12🤖 cs.LG

Expert-Data Alignment Governs Generation Quality in Decentralized Diffusion Models

This paper challenges the assumption that numerical stability governs generation quality in Decentralized Diffusion Models, demonstrating instead that aligning routing decisions with the experts whose training data best matches the current denoising state is the critical factor for achieving high-quality outputs.

Marcos Villagra, Bidhan Roy, Raihan Seraj, Zhiying Jiang2026-03-12🤖 cs.LG

A Bandit-Based Approach to Educational Recommender Systems: Contextual Thompson Sampling for Learner Skill Gain Optimization

This paper proposes a Contextual Thompson Sampling approach for educational recommender systems that leverages learner data to generate personalized exercise sequences, effectively optimizing skill gain and enabling scalable, adaptive instruction in digital learning environments.

Lukas De Kerpel, Arthur Thuy, Dries F. Benoit2026-03-12📊 stat

Universality of General Spiked Tensor Models

This paper establishes the universality of high-dimensional spectral behavior and statistical limits for asymmetric rank-one spiked tensor models with non-Gaussian noise, demonstrating that the maximum-likelihood estimator's performance matches the Gaussian case under finite fourth-moment assumptions.

Yanjin Xiang, Zhihua Zhang2026-03-12📊 stat

BLITZRANK: Principled Zero-shot Ranking Agents with Tournament Graphs

The paper introduces BLITZRANK, a principled zero-shot ranking framework that leverages tournament graphs to extract maximal information from $k$ -wise comparisons, achieving superior accuracy with significantly reduced token costs compared to existing methods.

Sheshansh Agrawal, Thien Hang Nguyen, Douwe Kiela2026-03-12🤖 cs.LG

Long Chain-of-Thought Compression via Fine-Grained Group Policy Optimization

This paper introduces Fine-grained Group Policy Optimization (FGO), a reinforcement learning algorithm that effectively compresses verbose Chain-of-Thought reasoning in Large Language Models while simultaneously addressing the data inefficiency and entropy collapse limitations of Group Relative Policy Optimization (GRPO).

Xinchen Han, Hossam Afifi, Michel Marot, Xilu Wang, Lu Yin2026-03-12🤖 cs.LG

GOT-JEPA: Generic Object Tracking with Model Adaptation and Occlusion Handling using Joint-Embedding Predictive Architecture

The paper proposes GOT-JEPA, a model-predictive pretraining framework that learns to predict robust tracking models from corrupted observations to improve generalization, and introduces OccuSolver to enhance occlusion handling through iterative, object-aware visibility estimation.

Shih-Fang Chen, Jun-Cheng Chen, I-Hong Jhuo, Yen-Yu Lin2026-03-12🤖 cs.AI

LexiSafe: Offline Safe Reinforcement Learning with Lexicographic Safety-Reward Hierarchy

The paper proposes LexiSafe, a theoretically grounded offline safe reinforcement learning framework that employs lexicographic prioritization to strictly enforce safety constraints while optimizing task performance, offering improved guarantees and empirical results over existing methods for safety-critical cyber-physical systems.

Hsin-Jung Yang, Zhanhong Jiang, Prajwal Koirala, Qisai Liu, Cody Fleming, Soumik Sarkar2026-03-12⚡ eess

ZACH-ViT: Regime-Dependent Inductive Bias in Compact Vision Transformers for Medical Imaging

The paper introduces ZACH-ViT, a compact Vision Transformer that eliminates positional embeddings and the [CLS] token to achieve permutation-invariant processing, demonstrating that this architecture is particularly effective for few-shot medical imaging tasks with weak spatial priors while remaining competitive on datasets with stronger anatomical structures.

Athanasios Angelakis2026-03-12⚡ eess

Benchmarking Graph Neural Networks in Solving Hard Constraint Satisfaction Problems

This paper introduces new hard benchmarks for Constraint Satisfaction Problems derived from statistical physics to demonstrate that, contrary to some claims, classical heuristics currently outperform Graph Neural Networks on truly difficult instances.

Geri Skenderi, Lorenzo Buffoni, Francesco D'Amico, David Machado, Raffaele Marino, Matteo Negri, Federico Ricci-Tersenghi, Carlo Lucibello, Maria Chiara Angelini2026-03-12🔬 cond-mat

Many AI Analysts, One Dataset: Navigating the Agentic Data Science Multiverse

This paper demonstrates that fully autonomous AI analysts can cheaply replicate the analytic diversity and conflicting conclusions observed in human many-analyst studies, revealing that empirical results are highly sensitive to analytic choices and prompting a new transparency norm requiring multiverse-style reporting and full prompt disclosure for AI-generated science.

Martin Bertran, Riccardo Fogliato, Zhiwei Steven Wu2026-03-12🤖 cs.AI

Active Value Querying to Minimize Additive Error in Subadditive Set Function Learning

This paper addresses the challenge of approximating unknown subadditive set functions by developing active querying methods to minimize additive error through the strategic disclosure of subset values, thereby reducing the distance between minimal and maximal completions in both offline and online settings.

Martin Černý, David Sychrovský, Filip Úradník, Jakub Černý2026-03-12🤖 cs.LG

How Large Language Models Get Stuck: Early structure with persistent errors

This paper investigates how Large Language Models trained on the BabyLM dataset often fail to learn specific grammatical rules because early erroneous biases, driven by misleading bigram statistics, become entrenched and persist throughout training, hindering efficient learning.

Alokesh Manna, William Snyder, Whitney Tabor2026-03-12💬 cs.CL

CARE: Towards Clinical Accountability in Multi-Modal Medical Reasoning with an Evidence-Grounded Agentic Framework

The paper introduces CARE, an evidence-grounded agentic framework that enhances clinical accountability and reasoning accuracy in multi-modal medical AI by decomposing tasks into specialized modules for entity proposal, pixel-level localization, and evidence-based reasoning, thereby outperforming state-of-the-art models on medical VQA benchmarks.

Yuexi Du, Jinglu Wang, Shujie Liu, Nicha C. Dvornek, Yan Lu2026-03-12🤖 cs.AI

CFG-Ctrl: Control-Based Classifier-Free Diffusion Guidance

This paper introduces CFG-Ctrl, a unified framework that reinterprets Classifier-Free Guidance as a control problem and proposes Sliding Mode Control CFG (SMC-CFG) to overcome the instability and overshooting of linear methods by enforcing nonlinear feedback for improved semantic alignment and robustness across various guidance scales.

Hanyang Wang, Yiyang Liu, Jiawei Chi, Fangfu Liu, Ran Xue, Yueqi Duan2026-03-12🤖 cs.LG

One Model, Many Skills: Parameter-Efficient Fine-Tuning for Multitask Code Analysis

This paper presents the first comprehensive evaluation of parameter-efficient fine-tuning (PEFT) for multitask code analysis, demonstrating that a single shared PEFT module can match or surpass full fine-tuning performance while significantly reducing computational and storage costs, provided that tasks are strategically grouped based on factors like complementarity and stability.

Amal Akli, Maxime Cordy, Mike Papadakis, Yves Le Traon2026-03-12💻 cs

Explainable LLM Unlearning Through Reasoning

This paper proposes Targeted Reasoning Unlearning (TRU), a novel framework that utilizes a reasoning-based unlearning target to guide models in precisely removing specific undesirable knowledge while preserving general capabilities and enhancing robustness against attacks.

Junfeng Liao, Qizhou Wang, Shanshan Ye, Xin Yu, Ling Chen, Zhen Fang2026-03-12🤖 cs.LG

MoE-SpAc: Efficient MoE Inference Based on Speculative Activation Utility in Heterogeneous Edge Scenarios

MoE-SpAc is an efficient inference framework for Mixture-of-Experts models on heterogeneous edge devices that repurposes speculative decoding as a predictive sensor for memory management, achieving significant throughput improvements through dynamic workload balancing and asynchronous execution.

Shuhuai Li, Jianghao Lin, Dongdong Ge, Yinyu Ye2026-03-12🤖 cs.LG

Evolving Demonstration Optimization for Chain-of-Thought Feature Transformation

This paper proposes a closed-loop framework that optimizes Large Language Model-driven Feature Transformation by evolving and selecting diverse, task-verified transformation trajectories via chain-of-thought reasoning, thereby outperforming existing methods in generating effective feature operators for downstream predictive tasks.

Xinyuan Wang, Kunpeng Liu, Arun Vignesh Malarkkan, Yanjie Fu2026-03-12💬 cs.CL

← Previous Next →