cs.CR papers | Gist.Science

CyberSleuth: Autonomous Blue-Team LLM Agent for Web Attack Forensics

This paper introduces CyberSleuth, an autonomous multi-agent LLM system that automates web attack forensics by analyzing network traces to identify compromised services and map exploits to specific CVEs, achieving 80% accuracy and demonstrating that simple orchestration with specialized agents outperforms complex hierarchical designs in generating expert-validated forensic reports.

Stefano Fumero, Kai Huang, Matteo Boffa, Danilo Giordano, Marco Mellia, Dario Rossi2026-03-06🔒 cs.CR

Secure human oversight of AI: Threat modeling in a socio-technical context

This paper introduces a security perspective on human oversight of AI by modeling it as an IT application to systematically identify new attack surfaces and propose mitigation strategies, thereby addressing a critical gap in current regulatory and academic discussions.

Jonas C. Ditz, Veronika Lazar, Elmar Lichtmeß, Carola Plesch, Matthias Heck, Kevin Baum, Markus Langer2026-03-06🔒 cs.CR

No exponential quantum speedup for $\mathrm{SIS}^\infty$ anymore

This paper presents efficient classical algorithms for the $\mathrm{SIS}^\infty$ and Constrained Integer Solution problems previously solved by an efficient quantum algorithm, thereby demonstrating that no exponential quantum speedup exists for these tasks.

Robin Kothari, Ryan O'Donnell, Kewen Wu2026-03-06🔒 cs.CR

Breaking and Fixing Defenses Against Control-Flow Hijacking in Multi-Agent Systems

This paper demonstrates that existing alignment-based defenses against control-flow hijacking in multi-agent systems are vulnerable to evasion due to inherent safety-functionality conflicts and limited context visibility, and proposes ControlValve, a new defense mechanism that enforces control-flow integrity and least privilege through permitted control-flow graphs and contextual rules.

Rishi Jha, Harold Triedman, Justin Wagle, Vitaly Shmatikov2026-03-06🔒 cs.CR

GhostEI-Bench: Do Mobile Agents Resilience to Environmental Injection in Dynamic On-Device Environments?

This paper introduces GhostEI-Bench, the first benchmark for evaluating the resilience of mobile Vision-Language Model agents against environmental injection attacks in dynamic on-device environments, revealing their critical vulnerability to adversarial UI elements that bypass textual safeguards and compromise device security.

Chiyu Chen, Xinhao Song, Yunkai Chai, Yang Yao, Haodong Zhao, Lijun Li, Jie Li, Yan Teng, Gongshen Liu, Yingchun Wang2026-03-06🔒 cs.CR

DeiTFake: Deepfake Detection Model using DeiT Multi-Stage Training

The paper introduces DeiTFake, a DeiT-based deepfake detection model that utilizes a novel two-stage progressive training strategy with increasing augmentation complexity to achieve state-of-the-art accuracy and robustness on the OpenForensics dataset.

Saksham Kumar, Ashish Singh, Srinivasarao Thota + 2 more2026-03-06💻 cs

BRIDG-ICS: AI-Grounded Knowledge Graphs for Intelligent Threat Analytics in Industry~5.0 Cyber-Physical Systems

The paper presents BRIDG-ICS, an AI-driven Knowledge Graph framework that integrates heterogeneous industrial and cybersecurity data using Large Language Models to enable context-aware threat analysis, multi-stage attack path simulation, and quantitative resilience assessment for Industry 5.0 cyber-physical systems.

Padmeswari Nandiya, Ahmad Mohsin, Ahmed Ibrahim, Iqbal H. Sarker, Helge Janicke2026-03-06🔒 cs.CR

Zombie Agents: Persistent Control of Self-Evolving LLM Agents via Self-Reinforcing Injections

This paper introduces "Zombie Agents," a persistent black-box attack on self-evolving LLM agents that covertly implants payloads into long-term memory during benign sessions to survive across interactions and trigger unauthorized actions in future sessions, demonstrating that current per-session defenses are insufficient against such memory-based compromises.

Xianglin Yang, Yufei He, Shuo Ji, Bryan Hooi, Jin Song Dong2026-03-06🔒 cs.CR

UC-Secure Star DKG for Non-Exportable Key Shares with VSS-Free Enforcement

This paper presents Star DKG (SDKG), a UC-secure distributed key generation protocol for non-exportable key shares in hardware-enforced environments that achieves transcript-driven affine consistency and 1+1-out-of- $n$ threshold access without relying on Verifiable Secret Sharing or share exportation.

Vipin Singh Sehrawat2026-03-06🔒 cs.CR

Lap2: Revisiting Laplace DP-SGD for High Dimensions via Majorization Theory

This paper introduces Lap2, a novel framework that enables L2-norm clipping for Laplace DP-SGD in high-dimensional models by leveraging majorization theory and Schur-convexity to overcome dimensionality barriers, thereby achieving privacy-utility performance comparable to or exceeding Gaussian DP-SGD.

Meisam Mohammady, Qin Yang, Nicholas Stout, Ayesha Samreen, Han Wang, Christopher J Quinn, Yuan Hong2026-03-06🔒 cs.CR

Jailbreak Foundry: From Papers to Runnable Attacks for Reproducible Benchmarking

The paper introduces Jailbreak Foundry (JBF), a multi-agent system that automatically translates jailbreak research papers into executable modules within a unified harness, enabling rapid, reproducible, and standardized benchmarking of large language model security against rapidly evolving attack techniques.

Zhicheng Fang, Jingjie Zheng, Chenxu Fu, Wei Xu2026-03-06🔒 cs.CR

Real Money, Fake Models: Deceptive Model Claims in Shadow APIs

This paper presents the first systematic audit revealing that widely used "shadow APIs," which claim to provide access to restricted frontier LLMs, frequently employ deceptive practices such as model substitution and safety manipulation, thereby compromising the reliability, reproducibility, and validity of downstream applications and academic research.

Yage Zhang, Yukun Jiang, Zeyuan Chen, Michael Backes, Xinyue Shen, Yang Zhang2026-03-06🔒 cs.CR

IoUCert: Robustness Verification for Anchor-based Object Detectors

The paper introduces IoUCert, a novel formal verification framework that overcomes the challenges of non-linear coordinate transformations and IoU metrics to enable the first robustness verification of realistic, anchor-based object detection models like SSD and YOLO.

Benedikt Brückner, Alejandro J. Mercado, Yanghao Zhang, Panagiotis Kouvaros, Alessio Lomuscio2026-03-06🔒 cs.CR

Reckless Designs and Broken Promises: Privacy Implications of Targeted Interactive Advertisements on Social Media Platforms

This paper reveals that the default interactive design of targeted advertisements on social media platforms like TikTok, Facebook, and Instagram creates a privacy loophole allowing advertisers to identify and view the profiles of users who engage with sensitive ads, thereby contradicting platform promises of data protection and highlighting the need for design modifications to ensure user transparency.

Julia B. Kieserman, Athanasios Andreou, Laura Edelson, Sandra Siby, Damon McCoy2026-03-06🔒 cs.CR

Zero-Knowledge Proof (ZKP) Authentication for Offline CBDC Payment System Using IoT Devices

This paper proposes a privacy-preserving, offline Central Bank Digital Currency (CBDC) payment model for resource-constrained IoT devices that integrates Secure Elements, lightweight Zero-Knowledge Proofs, and intermittent synchronization to enable secure, cash-like transactions while preventing double-spending and ensuring AML/CFT compliance without continuous internet connectivity.

Santanu Mondal, T. Chithralekha2026-03-06🔒 cs.CR

Measuring Privacy vs. Fidelity in Synthetic Social Media Datasets

This paper evaluates the privacy risks and fidelity of synthetic Instagram posts generated by large language models, demonstrating that while synthetic data significantly reduces authorship re-identification risks compared to real data, a trade-off exists where higher fidelity correlates with greater privacy leakage.

Henry Tari, Adriana Iamnitchi2026-03-06🔒 cs.CR

How Effective Are Publicly Accessible Deepfake Detection Tools? A Comparative Evaluation of Open-Source and Free-to-Use Platforms

This study evaluates six publicly accessible deepfake detection tools and finds that while forensic and AI-based classifiers exhibit complementary strengths and weaknesses, human evaluators with law enforcement experience significantly outperform all automated systems, particularly in resolving cases of disagreement.

Michael Rettinger, Ben Beaumont, Nhien-An Le-Khac, Hong-Hanh Nguyen-Le2026-03-06🔒 cs.CR

Benchmark of Benchmarks: Unpacking Influence and Code Repository Quality in LLM Safety Benchmarks

This paper presents the first multi-dimensional evaluation of 31 LLM safety benchmarks, revealing that while they do not outperform non-benchmark papers in academic influence, there is a critical misalignment where neither author prominence nor paper impact correlates with code quality, highlighting a significant need for improved repository readiness and ethical standards.

Junjie Chu, Xinyue Shen, Ye Leng, Michael Backes, Yun Shen, Yang Zhang2026-03-06🔒 cs.CR

Beyond Input Guardrails: Reconstructing Cross-Agent Semantic Flows for Execution-Aware Attack Detection

This paper introduces \SysName, a novel framework that enhances Multi-Agent System security by shifting from static input filtering to execution-aware analysis through the reconstruction of Cross-Agent Semantic Flows, effectively detecting complex attack vectors that bypass conventional guardrails.

Yangyang Wei, Yijie Xu, Zhenyuan Li, Xiangmin Shen, Shouling Ji2026-03-06🔒 cs.CR

Impact of 5G SA Logical Vulnerabilities on UAV Communications: Threat Models and Testbed Evaluation

This paper evaluates the impact of logical vulnerabilities in 5G Standalone networks on UAV communications by utilizing a Kubernetes-based testbed to demonstrate how attacks from malicious UEs, compromised gNodeBs, or the 5G core can disrupt operations, thereby highlighting the critical need for user plane isolation and protocol integrity.

Wagner Comin Sonaglio, Ágney Lopes Roth Ferraz, Lourenço Alves Pereira Júnior2026-03-06🔒 cs.CR

← Previous Next →

cs.CR