cs.AI papers | Gist.Science

Compatibility at a Cost: Systematic Discovery and Exploitation of MCP Clause-Compliance Vulnerabilities

This paper introduces the first systematic framework for identifying and exploiting "compatibility-abusing attacks" in the Model Context Protocol (MCP) by utilizing a language-agnostic intermediate representation and LLM-guided static analysis to uncover security vulnerabilities stemming from optional clause implementations across diverse SDKs.

Nanzi Yang, Weiheng Bai, Kangjie Lu2026-03-12🤖 cs.AI

MCP-in-SoS: Risk assessment framework for open-source MCP servers

This paper addresses the lack of systematic security evaluation for open-source Model Context Protocol (MCP) servers by applying static code analysis to identify Common Weakness Enumeration (CWE) vulnerabilities, mapping them to MITRE CAPEC attack patterns, and introducing a multi-metric risk-assessment framework to guide secure-by-design development.

Pratyay Kumar, Miguel Antonio Guirao Aguilera, Srikathyayani Srikanteswara, Satyajayant Misra, Abu Saleh Md Tayeen2026-03-12🤖 cs.AI

Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models

This paper introduces Adaptive Activation Cancellation (AAC), a real-time, training-free inference framework that mitigates hallucinations in large language models by identifying and suppressing hallucination-associated neural activations as structured interference, thereby improving factual accuracy across multiple model scales without degrading general capabilities or fluency.

Eric Yocam, Varghese Vaidyan, Gurcan Comert, Paris Kalathas, Yong Wang, Judith L. Mwakalonge2026-03-12💬 cs.CL

Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation

Delta-K is a training-free, plug-and-play inference framework that resolves concept omission in multi-instance image generation by extracting and injecting differential keys into the shared cross-attention Key space to establish coherent semantic representations without requiring architectural modifications or additional training.

Zitong Wang, Zijun Shen, Haohao Xu, Zhengjie Luo, Weibin Wu2026-03-12🤖 cs.AI

Multilingual AI-Driven Password Strength Estimation with Similarity-Based Detection

This research proposes a novel multilingual password strength meter that leverages AI-generated datasets (specifically ChatGPT) and Jaro similarity-based matching to outperform traditional models like PassGAN, demonstrating that incorporating non-English training data significantly enhances detection accuracy for language-specific vulnerabilities, particularly in the Indian context.

Nikitha M. Palaniappan, Ying He2026-03-12🤖 cs.AI

A Diffusion Analysis of Policy Gradient for Stochastic Bandits

This paper establishes that a continuous-time diffusion approximation of policy gradient for stochastic bandits achieves logarithmic regret with a learning rate of $O(\Delta^2/\log(n))$ , while demonstrating that a significantly smaller learning rate of $O(\Delta^2)$ is necessary to avoid linear regret in specific instances.

Tor Lattimore2026-03-12📊 stat

Robotic Ultrasound Makes CBCT Alive

This paper proposes a real-time, deformation-aware framework that integrates robotic ultrasound with a lightweight USCORUNet network to dynamically update static intraoperative CBCT scans, thereby enabling continuous soft-tissue monitoring and navigation refinement without repeated radiation exposure.

Feng Li, Ziyuan Li, Zhongliang Jiang, Nassir Navab, Yuan Bi2026-03-12🤖 cs.AI

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

This paper extends the harmonic loss framework by systematically evaluating various non-Euclidean distance metrics across vision and language models, demonstrating that cosine-based variants offer superior trade-offs in accuracy, interpretability, and sustainability compared to traditional cross-entropy and Euclidean approaches.

Maxwell Miller-Golub, Kamil Faber, Marcin Pietron, Panpan Zheng, Pasquale Minervini, Roberto Corizzo2026-03-12🤖 cs.LG

Learning from Radio using Variational Quantum RF Sensing

This paper proposes a variational quantum sensing framework that utilizes a quantum circuit-optimized probe to learn environmental features from radio-frequency signals, demonstrating through ray-tracing simulations that this approach enables robust localization with no deployment channel measurements and superior sensitivity to weak or obstructed signals compared to classical baselines.

Ivana Nikoloska2026-03-12⚛️ quant-ph

Intrinsic Numerical Robustness and Fault Tolerance in a Neuromorphic Algorithm for Scientific Computing

This paper demonstrates that a natively spiking neuromorphic algorithm for solving partial differential equations possesses intrinsic fault tolerance, maintaining accuracy even when up to 32% of neurons and 90% of spikes are dropped, with this robustness being tunable via structural hyperparameters.

Bradley H. Theilman, James B. Aimone2026-03-12🤖 cs.AI

DUCTILE: Agentic LLM Orchestration of Engineering Analysis in Product Development Practice

This paper introduces DUCTILE, an agentic LLM orchestration framework that separates adaptive decision-making from deterministic tool execution to automate engineering analysis in product development, successfully handling input deviations in an aerospace case study while highlighting the emerging tension between task automation and the creation of exhausting supervisory roles.

Alejandro Pradas-Gomez, Arindam Brahma, Ola Isaksson2026-03-12🤖 cs.AI

Joint Imaging-ROI Representation Learning via Cross-View Contrastive Alignment for Brain Disorder Classification

This paper proposes a unified cross-view contrastive learning framework that aligns global imaging and local ROI-graph representations to enhance brain disorder classification performance and reveal their complementary discriminative patterns, as validated on the ADHD-200 and ABIDE datasets.

Wei Liang, Lifang He2026-03-12🤖 cs.AI

Taming Score-Based Denoisers in ADMM: A Convergent Plug-and-Play Framework

This paper introduces ADMM-PnP with a novel AC-DC denoiser that resolves manifold mismatch and establishes convergence guarantees for score-based generative models within the ADMM framework, thereby improving solution quality across various inverse problems.

Rajesh Shrestha, Xiao Fu2026-03-12🤖 cs.LG

Conversational AI-Enhanced Exploration System to Query Large-Scale Digitised Collections of Natural History Museums

This paper presents a human-centred system design that leverages conversational AI and function-calling capabilities to enable natural language querying and visual-spatial exploration of nearly 1.7 million digitised natural history specimen records at the Australian Museum, overcoming the limitations of traditional keyword-based search tools.

Yiyuan Wang, Andrew Johnston, Zoë Sadokierski, Rhiannon Stephens, Shane T. Ahyong2026-03-12🤖 cs.AI

Quantum entanglement provides a competitive advantage in adversarial games

This study demonstrates that quantum entanglement serves as a functional resource in competitive reinforcement learning, enabling hybrid quantum-classical agents trained on the game Pong to consistently outperform separable quantum circuits and match or exceed classical baselines by learning structurally distinct features that better model dynamic agent interactions.

Peiyong Wang, Kieran Hymas, James Quach2026-03-12⚛️ quant-ph

Hybrid Self-evolving Structured Memory for GUI Agents

This paper introduces HyMEM, a hybrid self-evolving structured memory system that combines discrete symbolic nodes with continuous embeddings in a graph format to significantly enhance the performance of open-source GUI agents, enabling them to match or surpass strong closed-source models on complex, long-horizon tasks.

Sibo Zhu, Wenyi Wu, Kun Zhou, Stephen Wang, Biwei Huang2026-03-12🤖 cs.AI

Simulation-in-the-Reasoning (SiR): A Conceptual Framework for Empirically Grounded AI in Autonomous Transportation

This paper introduces Simulation-in-the-Reasoning (SiR), a conceptual framework that enhances Large Language Models for autonomous transportation by embedding domain-specific simulators directly into the reasoning loop to transform hypothetical narratives into falsifiable, empirically grounded workflows.

Wuping Xin2026-03-12⚡ eess

Is this Idea Novel? An Automated Benchmark for Judgment of Research Ideas

This paper introduces RINoBench, the first comprehensive benchmark for evaluating automated research idea novelty judgments, which reveals that while large language models can generate reasoning similar to human experts, they still struggle to produce accurate novelty assessments that align with human gold standards.

Tim Schopf, Michael Färber2026-03-12💬 cs.CL

NasoVoce: A Nose-Mounted Low-Audibility Speech Interface for Always-Available Speech Interaction

NasoVoce is a nose-mounted speech interface that fuses acoustic and vibration signals to enable robust, discreet, and always-available voice interaction for AI, effectively overcoming the limitations of existing silent and whispered speech recognition methods.

Jun Rekimoto, Yu Nishimura, Bojian Yang2026-03-12🤖 cs.AI

PC-Diffuser: Path-Consistent Capsule CBF Safety Filtering for Diffusion-Based Trajectory Planner

PC-Diffuser is a safety augmentation framework that integrates a certifiable, path-consistent capsule barrier function directly into the denoising loop of diffusion-based trajectory planners to ensure forward invariance and dynamically feasible motion while preserving the learned path geometry.

Eugene Ku, Yiwei Lyu2026-03-12🤖 cs.AI

← Previous Next →