cs.AI papers | Gist.Science

Multi-Stream Perturbation Attack: Breaking Safety Alignment of Thinking LLMs Through Concurrent Task Interference

This paper proposes a "multi-stream perturbation attack" that exploits vulnerabilities in the step-by-step reasoning of thinking-mode LLMs by interweaving multiple task streams to disrupt safety alignment, causing high attack success rates and inducing reasoning collapse or repetitive outputs across various models.

Fan Yang2026-03-12🤖 cs.AI

Execution Is the New Attack Surface: Survivability-Aware Agentic Crypto Trading with OpenClaw-Style Local Executors

This paper proposes Survivability-Aware Execution (SAE), a middleware framework for OpenClaw-style agentic crypto trading systems that enforces non-bypassable invariants like exposure budgets and order-rate limits to mitigate execution-induced losses from untrusted prompts or compromised skills, demonstrating significant reductions in maximum drawdown and risk metrics through offline replay testing.

Ailiya Borjigin, Igor Stadnyk, Ben Bilski, Serhii Hovorov, Sofiia Pidturkina2026-03-12🤖 cs.AI

Equivariant Asynchronous Diffusion: An Adaptive Denoising Schedule for Accelerated Molecular Conformation Generation

This paper introduces Equivariant Asynchronous Diffusion (EAD), a novel model that combines the strengths of auto-regressive and synchronous approaches through an adaptive denoising schedule to effectively capture molecular hierarchy and achieve state-of-the-art 3D molecular conformation generation.

Junyi An, Chao Qu, Yun-Fei Shi, Zhijian Zhou, Fenglei Cao, Yuan Qi2026-03-12🧬 q-bio

Code-Space Response Oracles: Generating Interpretable Multi-Agent Policies with Large Language Models

This paper introduces Code-Space Response Oracles (CSRO), a novel framework that replaces black-box deep reinforcement learning oracles with Large Language Models to generate human-readable, interpretable multi-agent policies as code, achieving competitive performance while enabling the discovery of complex, explainable strategies.

Daniel Hennes, Zun Li, John Schultz, Marc Lanctot2026-03-12🤖 cs.AI

Hardware Efficient Approximate Convolution with Tunable Error Tolerance for CNNs

This paper proposes a hardware-efficient "soft sparsity" paradigm for CNNs that utilizes a Most Significant Bit (MSB) proxy to skip negligible non-zero multiplications, achieving significant MAC and power reductions with zero accuracy loss while outperforming traditional zero-skipping methods.

Vishal Shashidhar, Anupam Kumari, Roy P Paily2026-03-12🤖 cs.LG

CLIPO: Contrastive Learning in Policy Optimization Generalizes RLVR

The paper introduces CLIPO, a method that integrates contrastive learning into policy optimization to generalize Reinforcement Learning with Verifiable Rewards (RLVR) by capturing invariant structures across correct reasoning paths, thereby mitigating hallucinations and improving the generalization and robustness of Large Language Models.

Sijia Cui, Pengyu Cheng, Jiajun Song, Yongbo Gai, Guojun Zhang, Zhechao Yu, Jianhe Lin, Xiaoxi Jiang, Guanjun Jiang2026-03-12🤖 cs.LG

Lost in the Middle at Birth: An Exact Theory of Transformer Position Bias

This paper argues that the "Lost in the Middle" phenomenon in large language models is an inherent geometric property of causal decoder architectures present at initialization, caused by the interplay of causal masking and residual connections that creates a structurally hostile "dead zone" in the middle of the context, a bias that persists even after standard pretraining.

Borun D Chowdhury2026-03-12🤖 cs.LG

AR-VLA: True Autoregressive Action Expert for Vision-Language-Action Models

The paper proposes AR-VLA, a standalone autoregressive Action Expert that maintains long-lived memory to generate continuous, context-aware action sequences, effectively addressing the frequency mismatch between fast control and slow reasoning while outperforming traditional reactive Vision-Language-Action models in trajectory smoothness and task success.

Yutong Hu, Jan-Nico Zaech, Nikolay Nikolov, Yuanqi Yao, Sombit Dey, Giuliano Albanese, Renaud Detry, Luc Van Gool, Danda Paudel2026-03-12🤖 cs.AI

Agentic Control Center for Data Product Optimization

This paper proposes an agentic control center that automates the continuous optimization of data products by employing specialized AI agents to generate supporting assets, monitor quality metrics, and integrate human-in-the-loop oversight to balance automation with trust.

Priyadarshini Tamilselvan, Gregory Bramble, Sola Shirai, Ken C. L. Wong, Faisal Chowdhury, Horst Samulowitz2026-03-12🤖 cs.AI

The Generation-Recognition Asymmetry: Six Dimensions of a Fundamental Divide in Formal Language Theory

This paper proposes a unified framework for the generation-recognition asymmetry in formal language theory by identifying six distinct dimensions of divergence, challenging the oversimplified view that generation is inherently easy while parsing is hard, and exploring the implications of these operational differences for fields ranging from compiler design to large language models.

Romain Peyrichou2026-03-12💬 cs.CL

Social Knowledge for Cross-Domain User Preference Modeling

This paper demonstrates that projecting users and entities into a joint social embedding space derived from large-scale Twitter data enables effective zero-shot cross-domain preference modeling and personalization, revealing that socio-demographic factors encoded in these embeddings correlate with user interests across diverse topics.

Nir Lotan, Adir Solomon, Ido Guy, Einat Minkov2026-03-12🤖 cs.AI

Mashup Learning: Faster Finetuning by Remixing Past Checkpoints

The paper proposes Mashup Learning, a method that accelerates LLM finetuning and improves downstream accuracy by identifying and merging relevant historical checkpoints to serve as an optimized initialization for new tasks, thereby reducing training time by up to 37% compared to training from scratch.

Sofia Maria Lo Cicero Vaina, Artem Chumachenko, Max Ryabinin2026-03-12🤖 cs.LG

Compatibility at a Cost: Systematic Discovery and Exploitation of MCP Clause-Compliance Vulnerabilities

This paper introduces the first systematic framework for identifying and exploiting "compatibility-abusing attacks" in the Model Context Protocol (MCP) by utilizing a language-agnostic intermediate representation and LLM-guided static analysis to uncover security vulnerabilities stemming from optional clause implementations across diverse SDKs.

Nanzi Yang, Weiheng Bai, Kangjie Lu2026-03-12🤖 cs.AI

MCP-in-SoS: Risk assessment framework for open-source MCP servers

This paper addresses the lack of systematic security evaluation for open-source Model Context Protocol (MCP) servers by applying static code analysis to identify Common Weakness Enumeration (CWE) vulnerabilities, mapping them to MITRE CAPEC attack patterns, and introducing a multi-metric risk-assessment framework to guide secure-by-design development.

Pratyay Kumar, Miguel Antonio Guirao Aguilera, Srikathyayani Srikanteswara, Satyajayant Misra, Abu Saleh Md Tayeen2026-03-12🤖 cs.AI

Adaptive Activation Cancellation for Hallucination Mitigation in Large Language Models

This paper introduces Adaptive Activation Cancellation (AAC), a real-time, training-free inference framework that mitigates hallucinations in large language models by identifying and suppressing hallucination-associated neural activations as structured interference, thereby improving factual accuracy across multiple model scales without degrading general capabilities or fluency.

Eric Yocam, Varghese Vaidyan, Gurcan Comert, Paris Kalathas, Yong Wang, Judith L. Mwakalonge2026-03-12💬 cs.CL

Delta-K: Boosting Multi-Instance Generation via Cross-Attention Augmentation

Delta-K is a training-free, plug-and-play inference framework that resolves concept omission in multi-instance image generation by extracting and injecting differential keys into the shared cross-attention Key space to establish coherent semantic representations without requiring architectural modifications or additional training.

Zitong Wang, Zijun Shen, Haohao Xu, Zhengjie Luo, Weibin Wu2026-03-12🤖 cs.AI

Multilingual AI-Driven Password Strength Estimation with Similarity-Based Detection

This research proposes a novel multilingual password strength meter that leverages AI-generated datasets (specifically ChatGPT) and Jaro similarity-based matching to outperform traditional models like PassGAN, demonstrating that incorporating non-English training data significantly enhances detection accuracy for language-specific vulnerabilities, particularly in the Indian context.

Nikitha M. Palaniappan, Ying He2026-03-12🤖 cs.AI

A Diffusion Analysis of Policy Gradient for Stochastic Bandits

This paper establishes that a continuous-time diffusion approximation of policy gradient for stochastic bandits achieves logarithmic regret with a learning rate of $O(\Delta^2/\log(n))$ , while demonstrating that a significantly smaller learning rate of $O(\Delta^2)$ is necessary to avoid linear regret in specific instances.

Tor Lattimore2026-03-12📊 stat

Robotic Ultrasound Makes CBCT Alive

This paper proposes a real-time, deformation-aware framework that integrates robotic ultrasound with a lightweight USCORUNet network to dynamically update static intraoperative CBCT scans, thereby enabling continuous soft-tissue monitoring and navigation refinement without repeated radiation exposure.

Feng Li, Ziyuan Li, Zhongliang Jiang, Nassir Navab, Yuan Bi2026-03-12🤖 cs.AI

Rethinking the Harmonic Loss via Non-Euclidean Distance Layers

This paper extends the harmonic loss framework by systematically evaluating various non-Euclidean distance metrics across vision and language models, demonstrating that cosine-based variants offer superior trade-offs in accuracy, interpretability, and sustainability compared to traditional cross-entropy and Euclidean approaches.

Maxwell Miller-Golub, Kamil Faber, Marcin Pietron, Panpan Zheng, Pasquale Minervini, Roberto Corizzo2026-03-12🤖 cs.LG

← Previous Next →