Compatibility at a Cost: Systematic Discovery and Exploitation of MCP Clause-Compliance Vulnerabilities

This paper introduces the first systematic framework for identifying and exploiting "compatibility-abusing attacks" in the Model Context Protocol (MCP) by utilizing a language-agnostic intermediate representation and LLM-guided static analysis to uncover security vulnerabilities stemming from optional clause implementations across diverse SDKs.

Nanzi Yang, Weiheng Bai, Kangjie Lu2026-03-12🤖 cs.AI

MCP-in-SoS: Risk assessment framework for open-source MCP servers

This paper addresses the lack of systematic security evaluation for open-source Model Context Protocol (MCP) servers by applying static code analysis to identify Common Weakness Enumeration (CWE) vulnerabilities, mapping them to MITRE CAPEC attack patterns, and introducing a multi-metric risk-assessment framework to guide secure-by-design development.

Pratyay Kumar, Miguel Antonio Guirao Aguilera, Srikathyayani Srikanteswara, Satyajayant Misra, Abu Saleh Md Tayeen2026-03-12🤖 cs.AI

Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models

This paper introduces Adaptive Activation Cancellation (AAC), a real-time, training-free inference framework that mitigates hallucinations in large language models by identifying and suppressing hallucination-associated neural activations as structured interference, thereby improving factual accuracy across multiple model scales without degrading general capabilities or fluency.

Eric Yocam, Varghese Vaidyan, Gurcan Comert, Paris Kalathas, Yong Wang, Judith L. Mwakalonge2026-03-12💬 cs.CL

Multilingual AI-Driven Password Strength Estimation with Similarity-Based Detection

This research proposes a novel multilingual password strength meter that leverages AI-generated datasets (specifically ChatGPT) and Jaro similarity-based matching to outperform traditional models like PassGAN, demonstrating that incorporating non-English training data significantly enhances detection accuracy for language-specific vulnerabilities, particularly in the Indian context.

Nikitha M. Palaniappan, Ying He2026-03-12🤖 cs.AI

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

This paper extends the harmonic loss framework by systematically evaluating various non-Euclidean distance metrics across vision and language models, demonstrating that cosine-based variants offer superior trade-offs in accuracy, interpretability, and sustainability compared to traditional cross-entropy and Euclidean approaches.

Maxwell Miller-Golub, Kamil Faber, Marcin Pietron, Panpan Zheng, Pasquale Minervini, Roberto Corizzo2026-03-12🤖 cs.LG

DUCTILE: Agentic LLM Orchestration of Engineering Analysis in Product Development Practice

This paper introduces DUCTILE, an agentic LLM orchestration framework that separates adaptive decision-making from deterministic tool execution to automate engineering analysis in product development, successfully handling input deviations in an aerospace case study while highlighting the emerging tension between task automation and the creation of exhausting supervisory roles.

Alejandro Pradas-Gomez, Arindam Brahma, Ola Isaksson2026-03-12🤖 cs.AI

Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History Museums

This paper presents a human-centred system design that leverages conversational AI and function-calling capabilities to enable natural language querying and visual-spatial exploration of nearly 1.7 million digitised natural history specimen records at the Australian Museum, overcoming the limitations of traditional keyword-based search tools.

Yiyuan Wang, Andrew Johnston, Zoë Sadokierski, Rhiannon Stephens, Shane T. Ahyong2026-03-12🤖 cs.AI

Quantum entanglement provides a competitive advantage in adversarial games

This study demonstrates that quantum entanglement serves as a functional resource in competitive reinforcement learning, enabling hybrid quantum-classical agents trained on the game Pong to consistently outperform separable quantum circuits and match or exceed classical baselines by learning structurally distinct features that better model dynamic agent interactions.

Peiyong Wang, Kieran Hymas, James Quach2026-03-12⚛️ quant-ph