LiM-YOLO: Less is More with Pyramid Level Shift and Normalized Auxiliary Branch for Ship Detection in Optical Remote Sensing Imagery

LiM-YOLO is a streamlined ship detection model for optical remote sensing imagery that achieves state-of-the-art accuracy with fewer parameters by shifting the detection pyramid from P3-P5 to P2-P4 to better resolve small vessels and employing Group Normalization to stabilize training on high-resolution inputs.

Seon-Hoon Kim, Hyeji Sim, Youeyun Jung, Ok-Chul Jung, Yerin KimWed, 11 Ma⚡ eess

Entropy-and-Channel-Aware Adaptive-Rate Semantic Communication with MLLM-Aided Feature Compensation

This paper proposes an entropy-and-channel-aware adaptive-rate semantic communication framework for MIMO Rayleigh fading channels that dynamically selects feature maps and symbols based on channel conditions and content complexity, while leveraging a fine-tuned multimodal large language model (MLLM) at the receiver to compensate for discarded information and optimize task performance across varying signal-to-noise ratios.

Weixuan Chen, Qianqian Yang, Yuhao Chen, Chongwen Huang, Qian Wang, Zehui Xiong, Zhaoyang ZhangWed, 11 Ma⚡ eess

PanoAffordanceNet: Towards Holistic Affordance Grounding in 360{\deg} Indoor Environments

This paper introduces PanoAffordanceNet, a novel framework and the first high-quality dataset (360-AGD) designed to enable holistic affordance grounding in 360-degree indoor environments by addressing challenges like geometric distortion and semantic dispersion through distortion-aware calibration and multi-level constraints.

Guoliang Zhu, Wanjun Jia, Caoyang Shao, Yuheng Zhang, Zhiyong Li, Kailun YangWed, 11 Ma⚡ eess

M2M^2-Occ: Resilient 3D Semantic Occupancy Prediction for Autonomous Driving with Incomplete Camera Inputs

The paper introduces M2M^2-Occ, a robust 3D semantic occupancy prediction framework that leverages a Multi-view Masked Reconstruction module and a Feature Memory Module to maintain geometric and semantic coherence under incomplete multi-camera inputs, significantly outperforming existing methods in scenarios with missing views.

Kaixin Lin, Kunyu Peng, Di Wen, Yufan Chen, Ruiping Liu, Kailun YangWed, 11 Ma⚡ eess

CycleULM: A unified label-free deep learning framework for ultrasound localisation microscopy

CycleULM is a novel, label-free deep learning framework that leverages CycleGAN to bridge the simulation-to-reality gap in ultrasound localisation microscopy, significantly enhancing microbubble localisation accuracy, image resolution, and processing speed for real-time clinical application without requiring paired ground truth data.

Su Yan, Clara Rodrigo Gonzalez, Vincent C. H. Leung, Herman Verinaz-Jadan, Jiakang Chen, Matthieu Toulemonde, Kai Riemer, Jipeng Yan, Clotilde Vié, Qingyuan Tan, Peter D. Weinberg, Pier Luigi Dragotti, Kevin G. Murphy, Meng-Xing TangWed, 11 Ma⚡ eess

Association of Radiologic PPFE Change with Mortality in Lung Cancer Screening Cohorts

This study demonstrates that the longitudinal progression of radiologic pleuroparenchymal fibroelastosis (PPFE), quantified via automated analysis of low-dose CT scans, independently predicts increased mortality and adverse respiratory outcomes in large lung cancer screening cohorts.

Shahab Aslani, Mehran Azimbagirad, Daryl Cheng, Daisuke Yamada, Ryoko Egashira, Adam Szmul, Justine Chan-Fook, Robert Chapman, Alfred Chung Pui So, Shanshan Wang, John McCabe, Tianqi Yang, Jose M Brenes, Eyjolfur Gudmundsson, The SUMMIT Consortium, Susan M. Astley, Daniel C. Alexander, Sam M. Janes, Joseph JacobWed, 11 Ma🧬 q-bio

M2Diff: Multi-Modality Multi-Task Enhanced Diffusion Model for MRI-Guided Low-Dose PET Enhancement

The paper introduces M2Diff, a multi-modality multi-task diffusion model that separately processes MRI and low-dose PET scans to extract and hierarchically fuse modality-specific features, thereby significantly improving the fidelity of standard-dose PET reconstruction for both healthy and Alzheimer's disease populations.

Ghulam Nabi Ahmad Hassan Yar, Himashi Peiris, Victoria Mar, Cameron Dennis Pain, Zhaolin ChenWed, 11 Ma⚡ eess

When to Lock Attention: Training-Free KV Control in Video Diffusion

KV-Lock is a training-free framework for DiT-based video diffusion models that dynamically adjusts background key-value locking and classifier-free guidance scales based on hallucination detection to simultaneously enhance foreground quality and maintain background consistency.

Tianyi Zeng, Jincheng Gao, Tianyi Wang, Zijie Meng, Miao Zhang, Jun Yin, Haoyuan Sun, Junfeng Jiao, Christian Claudel, Junbo Tan, Xueqian WangWed, 11 Ma🤖 cs.AI

POLISH'ing the Sky: Wide-Field and High-Dynamic Range Interferometric Image Reconstruction with Application to Strong Lens Discovery

This paper presents an enhanced deep learning framework, POLISH, which utilizes patch-wise training and nonlinear intensity transformations to achieve robust, high-dynamic-range, wide-field radio interferometric imaging, demonstrating its ability to significantly increase the discovery rate of strong gravitational lenses compared to traditional methods.

Zihui Wu, Liam Connor, Samuel McCarty, Katherine L. BoumanWed, 11 Ma🔭 astro-ph

TransUNet-GradCAM: A Hybrid Transformer-U-Net with Self-Attention and Explainable Visualizations for Foot Ulcer Segmentation

This paper presents TransUNet-GradCAM, a hybrid Vision Transformer-U-Net model that effectively segments diabetic foot ulcers by combining global attention with local feature extraction, achieving high accuracy on internal and external datasets while providing explainable visualizations for clinical utility.

Akwasi Asare, Mary Sagoe, Justice Williams Asare, Stephen Edward MooreTue, 10 Ma💻 cs

Transforming H&E images into IHC: A Variance-Penalized GAN for Precision Oncology

This study introduces a variance-penalized GAN based on pyramid pix2pix that generates high-fidelity HER2-specific immunohistochemistry (IHC) images from routine hematoxylin and eosin (H&E) slides, effectively mitigating mode collapse and outperforming baseline models to enable cost-effective, scalable precision oncology diagnostics.

Sara Rehmat, Hafeez Ur Rehman, Byeong-Gwon Kang, Sarra Ayouni, Yunyoung NamTue, 10 Ma💻 cs