cs.AI papers | Gist.Science

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

The paper introduces CLIPO, a method that integrates contrastive learning into policy optimization to generalize Reinforcement Learning with Verifiable Rewards (RLVR) by capturing invariant structures across correct reasoning paths, thereby mitigating hallucinations and improving the generalization and robustness of Large Language Models.

Sijia Cui, Pengyu Cheng, Jiajun Song, Yongbo Gai, Guojun Zhang, Zhechao Yu, Jianhe Lin, Xiaoxi Jiang, Guanjun Jiang2026-03-12🤖 cs.LG

Lost in the Middle at Birth: An Exact Theory of Transformer Position Bias

This paper argues that the "Lost in the Middle" phenomenon in large language models is an inherent geometric property of causal decoder architectures present at initialization, caused by the interplay of causal masking and residual connections that creates a structurally hostile "dead zone" in the middle of the context, a bias that persists even after standard pretraining.

Borun D Chowdhury2026-03-12🤖 cs.LG

AR-VLA: True Autoregressive Action Expert for Vision-Language-Action Models

The paper proposes AR-VLA, a standalone autoregressive Action Expert that maintains long-lived memory to generate continuous, context-aware action sequences, effectively addressing the frequency mismatch between fast control and slow reasoning while outperforming traditional reactive Vision-Language-Action models in trajectory smoothness and task success.

Yutong Hu, Jan-Nico Zaech, Nikolay Nikolov, Yuanqi Yao, Sombit Dey, Giuliano Albanese, Renaud Detry, Luc Van Gool, Danda Paudel2026-03-12🤖 cs.AI

Agentic Control Center for Data Product Optimization

This paper proposes an agentic control center that automates the continuous optimization of data products by employing specialized AI agents to generate supporting assets, monitor quality metrics, and integrate human-in-the-loop oversight to balance automation with trust.

Priyadarshini Tamilselvan, Gregory Bramble, Sola Shirai, Ken C. L. Wong, Faisal Chowdhury, Horst Samulowitz2026-03-12🤖 cs.AI

The Generation-Recognition Asymmetry: Six Dimensions of a Fundamental Divide in Formal Language Theory

This paper proposes a unified framework for the generation-recognition asymmetry in formal language theory by identifying six distinct dimensions of divergence, challenging the oversimplified view that generation is inherently easy while parsing is hard, and exploring the implications of these operational differences for fields ranging from compiler design to large language models.

Romain Peyrichou2026-03-12💬 cs.CL

Social Knowledge for Cross-Domain User Preference Modeling

This paper demonstrates that projecting users and entities into a joint social embedding space derived from large-scale Twitter data enables effective zero-shot cross-domain preference modeling and personalization, revealing that socio-demographic factors encoded in these embeddings correlate with user interests across diverse topics.

Nir Lotan, Adir Solomon, Ido Guy, Einat Minkov2026-03-12🤖 cs.AI

Mashup Learning: Faster Finetuning by Remixing Past Checkpoints

The paper proposes Mashup Learning, a method that accelerates LLM finetuning and improves downstream accuracy by identifying and merging relevant historical checkpoints to serve as an optimized initialization for new tasks, thereby reducing training time by up to 37% compared to training from scratch.

Sofia Maria Lo Cicero Vaina, Artem Chumachenko, Max Ryabinin2026-03-12🤖 cs.LG

Compatibility at a Cost: Systematic Discovery and Exploitation of MCP Clause-Compliance Vulnerabilities

This paper introduces the first systematic framework for identifying and exploiting "compatibility-abusing attacks" in the Model Context Protocol (MCP) by utilizing a language-agnostic intermediate representation and LLM-guided static analysis to uncover security vulnerabilities stemming from optional clause implementations across diverse SDKs.

Nanzi Yang, Weiheng Bai, Kangjie Lu2026-03-12🤖 cs.AI

MCP-in-SoS: Risk assessment framework for open-source MCP servers

This paper addresses the lack of systematic security evaluation for open-source Model Context Protocol (MCP) servers by applying static code analysis to identify Common Weakness Enumeration (CWE) vulnerabilities, mapping them to MITRE CAPEC attack patterns, and introducing a multi-metric risk-assessment framework to guide secure-by-design development.

Pratyay Kumar, Miguel Antonio Guirao Aguilera, Srikathyayani Srikanteswara, Satyajayant Misra, Abu Saleh Md Tayeen2026-03-12🤖 cs.AI

Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models

This paper introduces Adaptive Activation Cancellation (AAC), a real-time, training-free inference framework that mitigates hallucinations in large language models by identifying and suppressing hallucination-associated neural activations as structured interference, thereby improving factual accuracy across multiple model scales without degrading general capabilities or fluency.

Eric Yocam, Varghese Vaidyan, Gurcan Comert, Paris Kalathas, Yong Wang, Judith L. Mwakalonge2026-03-12💬 cs.CL

Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation

Delta-K is a training-free, plug-and-play inference framework that resolves concept omission in multi-instance image generation by extracting and injecting differential keys into the shared cross-attention Key space to establish coherent semantic representations without requiring architectural modifications or additional training.

Zitong Wang, Zijun Shen, Haohao Xu, Zhengjie Luo, Weibin Wu2026-03-12🤖 cs.AI

Multilingual AI-Driven Password Strength Estimation with Similarity-Based Detection

This research proposes a novel multilingual password strength meter that leverages AI-generated datasets (specifically ChatGPT) and Jaro similarity-based matching to outperform traditional models like PassGAN, demonstrating that incorporating non-English training data significantly enhances detection accuracy for language-specific vulnerabilities, particularly in the Indian context.

Nikitha M. Palaniappan, Ying He2026-03-12🤖 cs.AI

A Diffusion Analysis of Policy Gradient for Stochastic Bandits

This paper establishes that a continuous-time diffusion approximation of policy gradient for stochastic bandits achieves logarithmic regret with a learning rate of $O(\Delta^2/\log(n))$ , while demonstrating that a significantly smaller learning rate of $O(\Delta^2)$ is necessary to avoid linear regret in specific instances.

Tor Lattimore2026-03-12📊 stat

Robotic Ultrasound Makes CBCT Alive

This paper proposes a real-time, deformation-aware framework that integrates robotic ultrasound with a lightweight USCORUNet network to dynamically update static intraoperative CBCT scans, thereby enabling continuous soft-tissue monitoring and navigation refinement without repeated radiation exposure.

Feng Li, Ziyuan Li, Zhongliang Jiang, Nassir Navab, Yuan Bi2026-03-12🤖 cs.AI

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

This paper extends the harmonic loss framework by systematically evaluating various non-Euclidean distance metrics across vision and language models, demonstrating that cosine-based variants offer superior trade-offs in accuracy, interpretability, and sustainability compared to traditional cross-entropy and Euclidean approaches.

Maxwell Miller-Golub, Kamil Faber, Marcin Pietron, Panpan Zheng, Pasquale Minervini, Roberto Corizzo2026-03-12🤖 cs.LG

Learning from Radio using Variational Quantum RF Sensing

This paper proposes a variational quantum sensing framework that utilizes a quantum circuit-optimized probe to learn environmental features from radio-frequency signals, demonstrating through ray-tracing simulations that this approach enables robust localization with no deployment channel measurements and superior sensitivity to weak or obstructed signals compared to classical baselines.

Ivana Nikoloska2026-03-12⚛️ quant-ph

Intrinsic Numerical Robustness and Fault Tolerance in a Neuromorphic Algorithm for Scientific Computing

This paper demonstrates that a natively spiking neuromorphic algorithm for solving partial differential equations possesses intrinsic fault tolerance, maintaining accuracy even when up to 32% of neurons and 90% of spikes are dropped, with this robustness being tunable via structural hyperparameters.

Bradley H. Theilman, James B. Aimone2026-03-12🤖 cs.AI

DUCTILE: Agentic LLM Orchestration of Engineering Analysis in Product Development Practice

This paper introduces DUCTILE, an agentic LLM orchestration framework that separates adaptive decision-making from deterministic tool execution to automate engineering analysis in product development, successfully handling input deviations in an aerospace case study while highlighting the emerging tension between task automation and the creation of exhausting supervisory roles.

Alejandro Pradas-Gomez, Arindam Brahma, Ola Isaksson2026-03-12🤖 cs.AI

Joint Imaging-ROI Representation Learning via Cross-View Contrastive Alignment for Brain Disorder Classification

This paper proposes a unified cross-view contrastive learning framework that aligns global imaging and local ROI-graph representations to enhance brain disorder classification performance and reveal their complementary discriminative patterns, as validated on the ADHD-200 and ABIDE datasets.

Wei Liang, Lifang He2026-03-12🤖 cs.AI

Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework

This paper introduces ADMM-PnP with a novel AC-DC denoiser that resolves manifold mismatch and establishes convergence guarantees for score-based generative models within the ADMM framework, thereby improving solution quality across various inverse problems.

Rajesh Shrestha, Xiao Fu2026-03-12🤖 cs.LG

← Previous Next →