Enhancing Network Intrusion Detection Systems: A Multi-Layer Ensemble Approach to Mitigate Adversarial Attacks

This paper proposes a novel multi-layer ensemble defense mechanism combining stacking classifiers, autoencoders, and adversarial training to enhance the robustness of machine learning-based Network Intrusion Detection Systems against adversarial attacks generated by GANs and FGSM, demonstrating improved resilience on the UNSW-NB15 and NSL-KDD datasets.

Nasim Soltani, Shayan Nejadshamsi, Zakaria Abou El Houda, Raphael Khoury, Kelton A. P. Costa, Tiago H. Falk, Anderson R. Avila2026-03-12🤖 cs.AI

Domain-Adaptive Health Indicator Learning with Degradation-Stage Synchronized Sampling and Cross-Domain Autoencoder

This paper proposes a domain-adaptive framework featuring degradation-stage synchronized batch sampling and a cross-domain aligned fusion large autoencoder to overcome distribution mismatches and temporal dependency limitations in health indicator learning, achieving significant performance improvements on industrial datasets.

Jungho Choo, Hanbyeol Park, Gawon Lee, Yunkyung Park, Hyerim Bae2026-03-12🤖 cs.LG

The Curse and Blessing of Mean Bias in FP4-Quantized LLM Training

This paper identifies a coherent rank-one mean bias as the primary cause of numerical instability in low-bit LLM training and demonstrates that simply subtracting this mean restores stability and performance in FP4 quantization, offering a hardware-efficient alternative to complex spectral methods.

Hengjie Cao, Zhendong Huang, Mengyi Chen, Yifeng Yang, Fanqi Yu, Ruijun Huang, Fang Dong, Xin Zhang, Jixian Zhou, Anrui Chen, Mingzhi Dong, Yujiang Wang, Jinlong Hou, Qin Lv, Yuan Cheng, Tun Lu, Fan Yang, Li Shang2026-03-12🤖 cs.LG

FAR-Dex: Few-shot Data Augmentation and Adaptive Residual Policy Refinement for Dexterous Manipulation

FAR-Dex is a hierarchical framework that combines few-shot data augmentation via the IsaacLab simulator with an adaptive residual policy refinement module to overcome data scarcity and high-dimensional action space challenges, achieving robust and precise dexterous arm-hand coordination with over 80% success in real-world tasks.

Yushan Bai, Fulin Chen, Hongzheng Sun, Yuchuang Tong, En Li, Zhengtao Zhang2026-03-12🤖 cs.AI

Modeling Stage-wise Evolution of User Interests for News Recommendation

This paper proposes a unified framework for news recommendation that addresses the time-sensitive nature of user interests by combining global collaborative signals for long-term preferences with a stage-wise temporal subgraph approach, enhanced by LSTM and self-attention mechanisms, to effectively model both stable habits and rapidly evolving short-term dynamics.

Zhiyong Cheng, Yike Jin, Zhijie Zhang, Huilin Chen, Zhangling Duan, Meng Wang2026-03-12🤖 cs.AI

Aligning Large Language Models with Searcher Preferences

This paper introduces SearchLLM, the first large language model designed for open-ended generative search on platforms like RedNote, which utilizes a hierarchical multi-dimensional reward system and Gated Aggregation Strategy with GRPO to balance safety, factual grounding, and user alignment, resulting in measurable improvements in generation quality and user engagement.

Wei Wu, Peilun Zhou, Liyi Chen, Qimeng Wang, Chengqiang Lu, Yan Gao, Yi Wu, Yao Hu, Hui Xiong2026-03-12💬 cs.CL

Naïve Exposure of Generative AI Capabilities Undermines Deepfake Detection

This paper demonstrates that the naive exposure of powerful reasoning and image refinement capabilities in commercial generative AI chatbots fundamentally undermines state-of-the-art deepfake detectors by allowing adversaries to use benign, policy-compliant prompts to generate high-quality, identity-preserving images that evade detection, revealing a critical structural mismatch between current threat models and real-world AI capabilities.

Sunpill Kim, Chanwoo Hwang, Minsu Kim, Jae Hong Seo2026-03-12🤖 cs.AI

Resource-constrained Amazons chess decision framework integrating large language models and graph attention

This paper proposes a lightweight hybrid framework for the Game of the Amazons that integrates Graph Attention Autoencoders, Stochastic Graph Genetic Algorithms, and GPT-4o-mini to overcome resource constraints, achieving decision accuracy improvements of 15%–56% over baselines and outperforming its teacher model by effectively denoising LLM outputs through structural graph reasoning.

Tianhao Qian, Zhuoxuan Li, Jinde Cao, Xinli Shi, Hanjie Liu, Leszek Rutkowski2026-03-12🤖 cs.AI

IH-Challenge: A Training Dataset to Improve Instruction Hierarchy on Frontier LLMs

The paper introduces IH-Challenge, a reinforcement learning dataset designed to enhance instruction hierarchy robustness in frontier LLMs, which significantly improves their ability to prioritize instructions against conflicts and adversarial attacks while maintaining helpfulness and minimizing capability regression.

Chuan Guo (Michael Pokorny), Juan Felipe Ceron Uribe (Michael Pokorny), Sicheng Zhu (Michael Pokorny), Christopher A. Choquette-Choo (Michael Pokorny), Steph Lin (Michael Pokorny), Nikhil Kandpal (Michael Pokorny), Milad Nasr (Michael Pokorny), Rai (Michael Pokorny), Sam Toyer, Miles Wang, Yaodong Yu, Alex Beutel, Kai Xiao2026-03-12🤖 cs.AI

UAV-MARL: Multi-Agent Reinforcement Learning for Time-Critical and Dynamic Medical Supply Delivery

This paper presents a Multi-Agent Reinforcement Learning framework using Proximal Policy Optimization to coordinate UAV fleets for time-critical medical supply delivery, demonstrating that classical PPO outperforms asynchronous and sequential strategies in dynamically prioritizing tasks and reallocating resources under uncertain conditions using real-world geographic data.

Islam Guven, Mehmet Parlak2026-03-12🤖 cs.LG

Prompting with the human-touch: evaluating model-sensitivity of foundation models for musculoskeletal CT segmentation

This study evaluates 11 promptable foundation models for musculoskeletal CT segmentation across four anatomical regions, revealing that while specific models like SAM and nnInteractive perform best under ideal conditions, all models exhibit significant sensitivity to human prompting variations, leading to performance drops and highlighting the challenge of selecting robust models for real-world clinical applications.

Caroline Magg, Maaike A. ter Wee, Johannes G. G. Dobbe, Geert J. Streekstra, Leendert Blankevoort, Clara I. Sánchez, Hoel Kervadec2026-03-12🤖 cs.AI

Towards Cognitive Defect Analysis in Active Infrared Thermography with Vision-Text Cues

This paper introduces a novel language-guided framework that leverages pretrained vision-language models and a specialized adapter to achieve zero-shot, generative detection and localization of subsurface defects in carbon fiber-reinforced polymers using active infrared thermography, thereby eliminating the need for costly, task-specific training datasets while significantly improving signal-to-noise ratios and detection accuracy.

Mohammed Salah, Eman Ouda, Giuseppe Dell'Avvocato, Fabrizio Sarasini, Ester D'Accardi, Jorge Dias, Davor Svetinovic, Stefano Sfarra, Yusra Abdulrahman2026-03-12⚡ eess

Adaptive RAN Slicing Control via Reward-Free Self-Finetuning Agents

This paper proposes a novel self-finetuning framework that enables Generative AI agents to autonomously learn continuous control for dynamic Radio Access Network slicing by distilling long-horizon experiences into model parameters via a bi-perspective reflection mechanism, thereby outperforming traditional Reinforcement Learning and standard LLM-based agents in sample efficiency and multi-objective optimization without relying on handcrafted reward signals.

Yuanhao Li, Haozhe Wang, Geyong Min, Nektarios Georgalas, Wang Miao2026-03-12🤖 cs.AI

CUAAudit: Meta-Evaluation of Vision-Language Models as Auditors of Autonomous Computer-Use Agents

This paper presents CUAAudit, a large-scale meta-evaluation demonstrating that while Vision-Language Models can serve as autonomous auditors for Computer-Use Agents with strong accuracy and calibration, their significant performance degradation in complex environments and notable inter-model disagreement reveal fundamental limitations that necessitate explicit accounting for evaluator reliability and uncertainty in real-world deployments.

Marta Sumyk, Oleksandr Kosovan2026-03-12🤖 cs.AI

Does LLM Alignment Really Need Diversity? An Empirical Study of Adapting RLVR Methods for Moral Reasoning

This paper empirically demonstrates that contrary to the hypothesis that moral reasoning alignment requires diversity-seeking algorithms, standard reward-maximizing RLVR methods are equally or more effective because high-reward moral responses exhibit a concentrated distribution in semantic space similar to logical reasoning tasks.

Zhaowei Zhang, Xiaohan Liu, Xuekai Zhu, Junchao Huang, Ceyao Zhang, Zhiyuan Feng, Yaodong Yang, Xiaoyuan Yi, Xing Xie2026-03-12🤖 cs.AI