Multimodal Adversarial Quality Policy for Safe Grasping

This paper proposes the Multimodal Adversarial Quality Policy (MAQP), a framework that enhances safe robot grasping in human-robot interaction by introducing a Heterogeneous Dual-Patch Optimization Scheme and a Gradient-Level Modality Balancing Strategy to effectively generate multimodal adversarial patches that address distribution discrepancies and optimization imbalances between RGB and depth modalities.

Kunlin Xie, Chenghao Li, Haolan Zhang, Nak Young ChongWed, 11 Ma💻 cs

DOCFORGE-BENCH: A Comprehensive 0-shot Benchmark for Document Forgery Detection and Analysis

DOCFORGE-BENCH introduces the first unified zero-shot benchmark for document forgery detection, revealing that current methods suffer from severe calibration failures due to the extreme rarity of tampered pixels in documents, which renders standard fixed thresholds ineffective and highlights threshold adaptation as the critical missing step for practical deployment.

Zengqi Zhao, Weidi Xia, En Wei, Yan Zhang, Jane Mo, Tiannan Zhang, Yuanqin Dai, Zexi Chen, Yiran Tao, Simiao RenWed, 11 Ma💻 cs

Queer NLP: A Critical Survey on Literature Gaps, Biases and Trends

This survey critically examines the growing body of LGBTQIA+ NLP research within the ACL Anthology, revealing a reactive focus on identifying bias rather than proactive mitigation, and calls for future work to prioritize stakeholder involvement, intersectionality, interdisciplinarity, and non-English languages to build more just and inclusive technologies.

Sabine Weber, Angelina Wang, Ankush Gupta, Arjun Subramonian, Dennis Ulmer, Eshaan Tanwar, Geetanjali Aich, Hannah Devinney, Jacob Hobbs, Jennifer Mickel, Joshua Tint, Mae Sosto, Ray Groshan, Simone Astarita, Vagrant Gautam, Verena Blaschke, William Agnew, Wilson Y Lee, Yanan LongWed, 11 Ma💻 cs

A 26-Gram Butterfly-Inspired Robot Achieving Autonomous Tailless Flight

This paper introduces \textit{AirPulse}, a 26-gram butterfly-inspired robot that achieves the first autonomous, closed-loop tailless flight at this scale by replicating low-frequency, high-amplitude biomechanical traits through a hierarchical control architecture featuring Stroke Timing Asymmetry Rhythm (STAR).

Weibin Gu, Chenrui Feng, Lian Liu, Chen Yang, Xingchi Jiao, Yuhe Ding, Xiaofei Shi, Chao Gao, Alessandro Rizzo, Guyue ZhouWed, 11 Ma💻 cs

Pathwise Test-Time Correction for Autoregressive Long Video Generation

This paper introduces Test-Time Correction (TTC), a training-free method that stabilizes long-sequence video generation in distilled autoregressive models by using the initial frame as a reference anchor to calibrate intermediate states, thereby overcoming error accumulation and extending generation lengths without compromising quality.

Xunzhi Xiang, Zixuan Duan, Guiyu Zhang, Haiyu Zhang, Zhe Gao, Junta Wu, Shaofeng Zhang, Tengfei Wang, Qi Fan, Chunchao GuoWed, 11 Ma💻 cs

Optimal conversion from Rényi Differential Privacy to ff-Differential Privacy

This paper proves that the conjectured conversion rule, which maps a Rényi Differential Privacy profile to an ff-Differential Privacy trade-off function via the pointwise maximum of single-order bounds (equivalent to the intersection of RDP privacy regions), is optimal and cannot be uniformly improved upon for any valid RDP profile or Type I error level.

Anneliese Riess, Juan Felipe Gomez, Flavio du Pin Calmon, Julia Anne Schnabel, Georgios KaissisWed, 11 Ma💻 cs

CovertComBench: A First Domain-Specific Testbed for LLMs in Wireless Covert Communication

This paper introduces CovertComBench, a specialized benchmark for evaluating Large Language Models in wireless covert communication, revealing that while current models excel at conceptual understanding and code generation, they significantly struggle with the rigorous mathematical derivations required for security-constrained optimization.

Zhaozhi Liu, Jiaxin Chen, Yuanai Xie, Yuna Jiang, Minrui Xu, Xiao Zhang, Pan Lai, Zan ZhouWed, 11 Ma💻 cs

Low-rank Orthogonal Subspace Intervention for Generalizable Face Forgery Detection

To overcome the generalization failure of vanilla CLIP in face forgery detection caused by "low-rank spurious bias," this paper proposes SeLop, a causal representation learning method that identifies and removes spurious correlations via orthogonal low-rank subspace intervention, thereby achieving state-of-the-art performance with high robustness using only 0.39M trainable parameters.

Chi Wang, Xinjue Hu, Boyu Wang, Ziwen He, Zhangjie FuWed, 11 Ma💻 cs

A Tale of 1001 LoC: Potential Runtime Error-Guided Specification Synthesis for Verifying Large-Scale Programs

This paper introduces Preguss, a modular framework that combines static analysis with LLM-aided synthesis to automatically generate and refine interprocedural specifications, enabling highly automated verification of large-scale programs (over 1,000 lines of code) while significantly reducing human effort.

Zhongyi Wang, Tengjie Lin, Mingshuai Chen, Haokun Li, Mingqi Yang, Xiao Yi, Shengchao Qin, Yixing Luo, Xiaofeng Li, Bin Gu, Liqiang Lu, Jianwei YinWed, 11 Ma💻 cs

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

This paper introduces Directional Decoupling Alignment (D2^2-Align), a novel framework that mitigates Preference Mode Collapse in diffusion reinforcement learning by applying directional corrections to reward signals, thereby preserving generative diversity while achieving superior human preference alignment.

Chubin Chen, Sujie Hu, Jiashu Zhu, Meiqi Wu, Jintao Chen, Yanxun Li, Nisha Huang, Chengyu Fang, Jiahong Wu, Xiangxiang Chu, Xiu LiWed, 11 Ma💻 cs

UniBYD: A Unified Framework for Learning Robotic Manipulation Across Embodiments Beyond Imitation of Human Demonstrations

UniBYD is a unified framework that leverages a unified morphological representation and a dynamic reinforcement learning algorithm with a hybrid shadow engine to bridge the embodiment gap, enabling robotic hands to transcend human imitation and discover manipulation policies optimally adapted to their own physical morphologies.

Tingyu Yuan, Biaoliang Guan, Wen Ye, Ziyan Tian, Yi Yang, Weijie Zhou, Zhaowen Li, Yan Huang, Peng Wang, Chaoyang Zhao, Jinqiao WangWed, 11 Ma💻 cs

AVGGT: Rethinking Global Attention for Accelerating VGGT

This paper introduces AVGGT, a training-free acceleration framework that leverages an analysis of global attention's distinct roles in VGGT and π3\pi^3 to implement a two-step optimization strategy, achieving up to 10×\times inference speedup on long sequences while maintaining or improving accuracy in dense multi-view 3D reconstruction tasks.

Xianbing Sun, Zhikai Zhu, Zhengyu Lou, Bo Yang, Jinyang Tang, Liqing Zhang, He Wang, Jianfu ZhangWed, 11 Ma💻 cs