CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

The paper introduces CLIPO, a method that integrates contrastive learning into policy optimization to generalize Reinforcement Learning with Verifiable Rewards (RLVR) by capturing invariant structures across correct reasoning paths, thereby mitigating hallucinations and improving the generalization and robustness of Large Language Models.

Sijia Cui, Pengyu Cheng, Jiajun Song, Yongbo Gai, Guojun Zhang, Zhechao Yu, Jianhe Lin, Xiaoxi Jiang, Guanjun Jiang2026-03-12🤖 cs.LG

AR-VLA: True Autoregressive Action Expert for Vision-Language-Action Models

The paper proposes AR-VLA, a standalone autoregressive Action Expert that maintains long-lived memory to generate continuous, context-aware action sequences, effectively addressing the frequency mismatch between fast control and slow reasoning while outperforming traditional reactive Vision-Language-Action models in trajectory smoothness and task success.

Yutong Hu, Jan-Nico Zaech, Nikolay Nikolov, Yuanqi Yao, Sombit Dey, Giuliano Albanese, Renaud Detry, Luc Van Gool, Danda Paudel2026-03-12🤖 cs.AI

The Generation-Recognition Asymmetry: Six Dimensions of a Fundamental Divide in Formal Language Theory

This paper proposes a unified framework for the generation-recognition asymmetry in formal language theory by identifying six distinct dimensions of divergence, challenging the oversimplified view that generation is inherently easy while parsing is hard, and exploring the implications of these operational differences for fields ranging from compiler design to large language models.

Romain Peyrichou2026-03-12💬 cs.CL

Compatibility at a Cost: Systematic Discovery and Exploitation of MCP Clause-Compliance Vulnerabilities

This paper introduces the first systematic framework for identifying and exploiting "compatibility-abusing attacks" in the Model Context Protocol (MCP) by utilizing a language-agnostic intermediate representation and LLM-guided static analysis to uncover security vulnerabilities stemming from optional clause implementations across diverse SDKs.

Nanzi Yang, Weiheng Bai, Kangjie Lu2026-03-12🤖 cs.AI

MCP-in-SoS: Risk assessment framework for open-source MCP servers

This paper addresses the lack of systematic security evaluation for open-source Model Context Protocol (MCP) servers by applying static code analysis to identify Common Weakness Enumeration (CWE) vulnerabilities, mapping them to MITRE CAPEC attack patterns, and introducing a multi-metric risk-assessment framework to guide secure-by-design development.

Pratyay Kumar, Miguel Antonio Guirao Aguilera, Srikathyayani Srikanteswara, Satyajayant Misra, Abu Saleh Md Tayeen2026-03-12🤖 cs.AI

Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models

This paper introduces Adaptive Activation Cancellation (AAC), a real-time, training-free inference framework that mitigates hallucinations in large language models by identifying and suppressing hallucination-associated neural activations as structured interference, thereby improving factual accuracy across multiple model scales without degrading general capabilities or fluency.

Eric Yocam, Varghese Vaidyan, Gurcan Comert, Paris Kalathas, Yong Wang, Judith L. Mwakalonge2026-03-12💬 cs.CL

Multilingual AI-Driven Password Strength Estimation with Similarity-Based Detection

This research proposes a novel multilingual password strength meter that leverages AI-generated datasets (specifically ChatGPT) and Jaro similarity-based matching to outperform traditional models like PassGAN, demonstrating that incorporating non-English training data significantly enhances detection accuracy for language-specific vulnerabilities, particularly in the Indian context.

Nikitha M. Palaniappan, Ying He2026-03-12🤖 cs.AI

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

This paper extends the harmonic loss framework by systematically evaluating various non-Euclidean distance metrics across vision and language models, demonstrating that cosine-based variants offer superior trade-offs in accuracy, interpretability, and sustainability compared to traditional cross-entropy and Euclidean approaches.

Maxwell Miller-Golub, Kamil Faber, Marcin Pietron, Panpan Zheng, Pasquale Minervini, Roberto Corizzo2026-03-12🤖 cs.LG

DUCTILE: Agentic LLM Orchestration of Engineering Analysis in Product Development Practice

This paper introduces DUCTILE, an agentic LLM orchestration framework that separates adaptive decision-making from deterministic tool execution to automate engineering analysis in product development, successfully handling input deviations in an aerospace case study while highlighting the emerging tension between task automation and the creation of exhausting supervisory roles.

Alejandro Pradas-Gomez, Arindam Brahma, Ola Isaksson2026-03-12🤖 cs.AI