Progressive Split Mamba: Effective State Space Modelling for Image Restoration

The paper proposes Progressive Split-Mamba (PS-Mamba), a topology-aware hierarchical state-space framework that addresses the spatial distortion and long-range decay limitations of standard Mamba models in image restoration by employing geometry-consistent partitioning and symmetric cross-scale shortcuts to effectively balance local structural preservation with global coherence.

Mohammed Hassanin, Nour Moustafa, Weijian Deng, Ibrahim RadwanWed, 11 Ma💻 cs

Agentic AI as a Network Control-Plane Intelligence Layer for Federated Learning over 6G

This paper proposes an Agentic AI framework that serves as a control-plane intelligence layer for 6G networks, utilizing specialized agents to dynamically manage federated learning tasks by integrating network conditions with learning objectives to optimize client selection, resource allocation, and scheduling.

Loc X. Nguyen, Ji Su Yoon, Huy Q. Le, Yu Qiao, Avi Deb Raha, Eui-Nam Huh, Nguyen H. Tran, Choong Seon HongWed, 11 Ma💻 cs

Transformer-Based Multi-Region Segmentation and Radiomic Analysis of HR-pQCT Imaging

This paper introduces a novel, fully automated framework that utilizes a SegFormer transformer to segment multiple anatomical regions in HR-pQCT images and extract radiomic features, demonstrating that soft tissue analysis outperforms traditional bone-based metrics in detecting osteoporosis.

Mohseu Rashid Subah, Mohammed Abdul Gani Zilani, Thomas L. Nickolas, Matthew R. Allen, Stuart J. Warden, Rachel K. SurowiecWed, 11 Ma💻 cs

MedKCO: Medical Vision-Language Pretraining via Knowledge-Driven Cognitive Orchestration

MedKCO is a medical vision-language pretraining framework that overcomes the limitations of simultaneous concept learning by employing a two-level curriculum for data ordering and a self-paced asymmetric contrastive loss to dynamically adjust the learning objective, thereby significantly improving feature representations and downstream task performance.

Chenran Zhang, Ruiqi Wu, Tao Zhou, Yi ZhouWed, 11 Ma💻 cs

Diffusion-Based Authentication of Copy Detection Patterns: A Multimodal Framework with Printer Signature Conditioning

This paper proposes a novel diffusion-based framework that enhances Copy Detection Pattern authentication by integrating printer signatures and ControlNet to effectively distinguish genuine prints from high-quality counterfeits, outperforming traditional methods in generalization and accuracy.

Bolutife Atoki, Iuliia Tkachenko, Bertrand Kerautret, Carlos Crispim-JuniorWed, 11 Ma💻 cs

SurgCalib: Gaussian Splatting-Based Hand-Eye Calibration for Robot-Assisted Minimally Invasive Surgery

This paper presents SurgCalib, a markerless, Gaussian Splatting-based framework that achieves accurate hand-eye calibration for the da Vinci surgical robot by refining kinematic estimates through a differentiable rendering pipeline, thereby overcoming cable-driven inaccuracies and avoiding the sterility issues associated with traditional fiducial markers.

Zijian Wu, Shuojue Yang, Yu Chung Lee, Eitan Prisman, Yueming Jin, Septimiu E. SalcudeanWed, 11 Ma💻 cs

SVG-EAR: Parameter-Free Linear Compensation for Sparse Video Generation via Error-aware Routing

The paper introduces SVG-EAR, a parameter-free method that enhances sparse video generation in Diffusion Transformers by using semantic clustering for linear compensation and error-aware routing to selectively compute high-error blocks, thereby achieving significant speedups while maintaining generation fidelity.

Xuanyi Zhou, Qiuyang Mang, Shuo Yang, Haocheng Xi, Jintao Zhang, Huanzhi Mao, Joseph E. Gonzalez, Kurt Keutzer, Ion Stoica, Alvin CheungWed, 11 Ma💻 cs

LiM-YOLO: Less is More with Pyramid Level Shift and Normalized Auxiliary Branch for Ship Detection in Optical Remote Sensing Imagery

LiM-YOLO is a streamlined ship detection model for optical remote sensing imagery that achieves state-of-the-art accuracy with fewer parameters by shifting the detection pyramid from P3-P5 to P2-P4 to better resolve small vessels and employing Group Normalization to stabilize training on high-resolution inputs.

Seon-Hoon Kim, Hyeji Sim, Youeyun Jung, Ok-Chul Jung, Yerin KimWed, 11 Ma⚡ eess