cs papers | Gist.Science

Taming Preference Mode Collapse via Directional Decoupling Alignment in Diffusion Reinforcement Learning

This paper introduces Directional Decoupling Alignment (D $^2$ -Align), a novel framework that mitigates Preference Mode Collapse in diffusion reinforcement learning by applying directional corrections to reward signals, thereby preserving generative diversity while achieving superior human preference alignment.

Chubin Chen, Sujie Hu, Jiashu Zhu, Meiqi Wu, Jintao Chen, Yanxun Li, Nisha Huang, Chengyu Fang, Jiahong Wu, Xiangxiang Chu, Xiu Li2026-03-11💻 cs

A Tale of 1001 LoC: Potential Runtime Error-Guided Specification Synthesis for Verifying Large-Scale Programs

This paper introduces Preguss, a modular framework that combines static analysis with LLM-aided synthesis to automatically generate and refine interprocedural specifications, enabling highly automated verification of large-scale programs (over 1,000 lines of code) while significantly reducing human effort.

Zhongyi Wang, Tengjie Lin, Mingshuai Chen, Haokun Li, Mingqi Yang, Xiao Yi, Shengchao Qin, Yixing Luo, Xiaofeng Li, Bin Gu, Liqiang Lu, Jianwei Yin2026-03-11💻 cs

Low-rank Orthogonal Subspace Intervention for Generalizable Face Forgery Detection

To overcome the generalization failure of vanilla CLIP in face forgery detection caused by "low-rank spurious bias," this paper proposes SeLop, a causal representation learning method that identifies and removes spurious correlations via orthogonal low-rank subspace intervention, thereby achieving state-of-the-art performance with high robustness using only 0.39M trainable parameters.

Chi Wang, Xinjue Hu, Boyu Wang, Ziwen He, Zhangjie Fu2026-03-11💻 cs

Towards a Goal-Centric Assessment of Requirements Engineering Methods for Privacy by Design

This paper proposes a goal-centric framework for assessing Requirements Engineering methods for Privacy by Design, arguing that practitioners should evaluate these methods based on organizational goals rather than solely on process characteristics to better support their selection and tailoring.

Oleksandr Kosenkov, Ehsan Zabardast, Jannik Fischbach, Tony Gorschek, Daniel Mendez2026-03-11💻 cs

CovertComBench: A First Domain-Specific Testbed for LLMs in Wireless Covert Communication

This paper introduces CovertComBench, a specialized benchmark for evaluating Large Language Models in wireless covert communication, revealing that while current models excel at conceptual understanding and code generation, they significantly struggle with the rigorous mathematical derivations required for security-constrained optimization.

Zhaozhi Liu, Jiaxin Chen, Yuanai Xie, Yuna Jiang, Minrui Xu, Xiao Zhang, Pan Lai, Zan Zhou2026-03-11💻 cs

Exploiting the Final Component of Generator Architectures for AI-Generated Image Detection

This paper proposes a novel AI-generated image detection method that exploits common final architectural components across diverse generators to "contaminate" real images for training, achieving 98.83% average accuracy on unseen generators by leveraging a taxonomy of 21 models and a DINOv3 backbone.

Yanzhu Liu, Xiao Liu, Yuexuan Wang, Mondal Soumik2026-03-11💻 cs

RegionReasoner: Region-Grounded Multi-Round Visual Reasoning

This paper introduces RegionReasoner, a reinforcement learning framework that enforces grounded, multi-round visual reasoning by requiring explicit bounding box citations and global-local semantic consistency, alongside a new benchmark called RegionDial-Bench, to significantly improve spatial grounding and reasoning accuracy in vision-language models.

Wenfang Sun, Hao Chen, Yingjun Du, Yefeng Zheng, Cees G. M. Snoek2026-03-11💻 cs

Optimal conversion from Rényi Differential Privacy to $f$ -Differential Privacy

This paper proves that the conjectured conversion rule, which maps a Rényi Differential Privacy profile to an $f$ -Differential Privacy trade-off function via the pointwise maximum of single-order bounds (equivalent to the intersection of RDP privacy regions), is optimal and cannot be uniformly improved upon for any valid RDP profile or Type I error level.

Anneliese Riess, Juan Felipe Gomez, Flavio du Pin Calmon, Julia Anne Schnabel, Georgios Kaissis2026-03-11💻 cs

Pathwise Test-Time Correction for Autoregressive Long Video Generation

This paper introduces Test-Time Correction (TTC), a training-free method that stabilizes long-sequence video generation in distilled autoregressive models by using the initial frame as a reference anchor to calibrate intermediate states, thereby overcoming error accumulation and extending generation lengths without compromising quality.

Xunzhi Xiang, Zixuan Duan, Guiyu Zhang, Haiyu Zhang, Zhe Gao, Junta Wu, Shaofeng Zhang, Tengfei Wang, Qi Fan, Chunchao Guo2026-03-11💻 cs

A 26-Gram Butterfly-Inspired Robot Achieving Autonomous Tailless Flight

This paper introduces \textit{AirPulse}, a 26-gram butterfly-inspired robot that achieves the first autonomous, closed-loop tailless flight at this scale by replicating low-frequency, high-amplitude biomechanical traits through a hierarchical control architecture featuring Stroke Timing Asymmetry Rhythm (STAR).

Weibin Gu, Chenrui Feng, Lian Liu, Chen Yang, Xingchi Jiao, Yuhe Ding, Xiaofei Shi, Chao Gao, Alessandro Rizzo, Guyue Zhou2026-03-11💻 cs

Multimodal Classification via Total Correlation Maximization

This paper addresses the issue of modality competition in multimodal learning by theoretically analyzing the relationship between joint and unimodal approaches and proposing TCMax, a hyperparameter-free method that maximizes total correlation between multimodal features and labels to achieve state-of-the-art classification performance.

Feng Yu, Xiangyu Wu, Yang Yang, Jianfeng Lu2026-03-11💻 cs

Queer NLP: A Critical Survey on Literature Gaps, Biases and Trends

This survey critically examines the growing body of LGBTQIA+ NLP research within the ACL Anthology, revealing a reactive focus on identifying bias rather than proactive mitigation, and calls for future work to prioritize stakeholder involvement, intersectionality, interdisciplinarity, and non-English languages to build more just and inclusive technologies.

Sabine Weber, Angelina Wang, Ankush Gupta, Arjun Subramonian, Dennis Ulmer, Eshaan Tanwar, Geetanjali Aich, Hannah Devinney, Jacob Hobbs, Jennifer Mickel, Joshua Tint, Mae Sosto, Ray Groshan, Simone Astarita, Vagrant Gautam, Verena Blaschke, William Agnew, Wilson Y Lee, Yanan Long2026-03-11💻 cs

Algorithmic Collusion at Test Time: A Meta-game Design and Evaluation

This paper introduces a meta-game framework to evaluate the emergence of algorithmic collusion under test-time constraints by modeling agents with pretrained policies and adaptation rules, revealing how rational choices and co-adaptation influence cooperative or competitive outcomes in repeated pricing games across various algorithmic strategies.

Yuhong Luo, Daniel Schoepflin, Xintong Wang2026-03-11💻 cs

ChimeraLoRA: Multi-Head LoRA-Guided Synthetic Datasets

ChimeraLoRA addresses data scarcity in fine-grained tasks by synthesizing diverse and detail-rich images through a hybrid architecture that combines a class-shared LoRA for semantic priors with per-image LoRAs for specific characteristics, guided by semantic boosting and a Dirichlet-based mixture strategy to improve downstream classification accuracy.

Hoyoung Kim, Minwoo Jang, Jabin Koo, Sangdoo Yun, Jungseul Ok2026-03-11💻 cs

DOCFORGE-BENCH: A Comprehensive 0-shot Benchmark for Document Forgery Detection and Analysis

DOCFORGE-BENCH introduces the first unified zero-shot benchmark for document forgery detection, revealing that current methods suffer from severe calibration failures due to the extreme rarity of tampered pixels in documents, which renders standard fixed thresholds ineffective and highlights threshold adaptation as the critical missing step for practical deployment.

Zengqi Zhao, Weidi Xia, En Wei, Yan Zhang, Jane Mo, Tiannan Zhang, Yuanqin Dai, Zexi Chen, Yiran Tao, Simiao Ren2026-03-11💻 cs

Multimodal Adversarial Quality Policy for Safe Grasping

This paper proposes the Multimodal Adversarial Quality Policy (MAQP), a framework that enhances safe robot grasping in human-robot interaction by introducing a Heterogeneous Dual-Patch Optimization Scheme and a Gradient-Level Modality Balancing Strategy to effectively generate multimodal adversarial patches that address distribution discrepancies and optimization imbalances between RGB and depth modalities.

Kunlin Xie, Chenghao Li, Haolan Zhang, Nak Young Chong2026-03-11💻 cs

Hardness of the Binary Covering Radius Problem in Large $\ell_p$ Norms

This paper establishes the first explicit $\mathsf{NP}$ -hardness results for the approximate Covering Radius Problem on lattices in $\ell_p$ norms for sufficiently large $p$ (specifically $p > 35.31$ ), proving that the problem remains hard even for approximation factors approaching $9/8 $as$ p$ tends to infinity.

Huck Bennett, Peter Ly2026-03-11💻 cs

Scaling Multilingual Semantic Search in Uber Eats Delivery

This paper presents a production-oriented, unified multilingual semantic retrieval system for Uber Eats that leverages a fine-tuned Qwen2 two-tower model with advanced loss functions and Matryoshka Representation Learning to achieve significant recall improvements across stores, dishes, and grocery items.

Bo Ling, Zheng Liu, Haoyang Chen, Divya Nagar, Luting Yang, Mehul Parsana2026-03-11💻 cs

A Hybrid Residue Floating Numerical Architecture with Formal Error Bounds for High Throughput FPGA Computation

This paper introduces the Hybrid Residue Floating Numerical Architecture (HRFNA), a formally verified numerical system combining carry-free residue arithmetic with lightweight exponent scaling that achieves significantly higher throughput, reduced resource usage, and improved energy efficiency on FPGAs compared to IEEE 754 standards while maintaining rigorous, bounded numerical error.

Mostafa Darvishi2026-03-11💻 cs

On the Multi-Commodity Flow with convex objective function: Column-Generation approaches

This paper proposes a column-generation-based algorithmic framework to efficiently solve both splittable and unsplittable variants of the capacitated Multi-Commodity Flow problem with convex, potentially non-differentiable, link cost functions, offering a robust optimization approach for managing traffic in telecommunication networks.

Guillaume Beraud-Sudreau, Lucas Létocart, Youcef Magnouche, Sébastien Martin2026-03-11💻 cs

← Previous Next →

cs