Evaluating quality metrics through the lenses of psychophysical measurements of low-level vision

This paper introduces a new framework of psychophysical tests based on low-level vision principles—specifically contrast sensitivity, masking, and matching—to evaluate and reveal the perceptual strengths and weaknesses of 34 existing image and video quality metrics, demonstrating that standard evaluation protocols often fail to capture these fundamental human visual properties.

Dounia Hammou, Yancheng Cai, Pavan Madhusudanarao, Christos G. Bampis, Rafał K. MantiukMon, 09 Ma💻 cs

ECLARE: Efficient cross-planar learning for anisotropic resolution enhancement

ECLARE is an open-source, self-supervised super-resolution method that enhances anisotropic 2D MR volumes by estimating slice profiles and learning in-plane mappings without external data, thereby overcoming domain shift and outperforming existing techniques in both signal recovery and downstream tasks.

Samuel W. Remedios, Shuwen Wei, Shuo Han, Jinwei Zhang, Aaron Carass, Kurt G. Schilling, Dzung L. Pham, Jerry L. Prince, Blake E. DeweyMon, 09 Ma💻 cs

Architectural Unification for Polarimetric Imaging Across Multiple Degradations

This paper proposes a unified, single-stage architectural framework that jointly processes image and Stokes domains to achieve state-of-the-art performance in recovering polarimetric parameters from various degraded observations, including low-light noise, motion blur, and mosaicing artifacts, while ensuring physical consistency and avoiding error accumulation.

Chu Zhou, Yufei Han, Junda Liao, Linrui Dai, Wangze Xu, Art Subpa-Asa, Heng Guo, Boxin Shi, Imari SatoMon, 09 Ma💻 cs

Uni-LVC: A Unified Method for Intra- and Inter-Mode Learned Video Compression

Uni-LVC is a unified learned video compression framework that integrates intra and inter coding into a single model by conditioning inter-coding on temporal cues via a cross-attention module and a reliability-aware classifier, thereby achieving superior rate-distortion performance across low-delay and random-access scenarios while maintaining computational efficiency.

Yichi Zhang, Ruoyu Yang, Fengqing ZhuMon, 09 Ma💻 cs

Gabor Primitives for Accelerated Cardiac Cine MRI Reconstruction

This paper proposes a cardiac cine MRI reconstruction method using Gabor primitives, which combine Gaussian envelopes with complex exponentials to enable flexible k-space coverage and a low-rank spatiotemporal decomposition, achieving superior performance over compressed sensing, Gaussian primitives, and implicit neural representations while offering physically interpretable parameters.

Wenqi Huang, Veronika Spieker, Nil Stolt-Ansó, Natascha Niessen, Maik Dannecker, Sevgi Gokce Kafali, Sila Kurugol, Julia A. Schnabel, Daniel RueckertMon, 09 Ma💻 cs

Privacy-Preserving Collaborative Medical Image Segmentation Using Latent Transform Networks

This paper introduces PPCMI-SF, a privacy-preserving collaborative framework that utilizes client-specific latent transforms and server-side mapping to achieve high-accuracy, real-time medical image segmentation across heterogeneous institutions while effectively resisting inversion and membership inference attacks without sharing raw data.

Saheed Ademola Bello, Muhammad Shahid Jabbar, Muhammad Sohail Ibrahim, Shujaat KhanMon, 09 Ma💻 cs

Can we Trust Unreliable Voxels? Exploring 3D Semantic Occupancy Prediction under Label Noise

This paper introduces OccNL, the first benchmark for 3D semantic occupancy prediction under label noise, and proposes DPR-Occ, a novel framework that leverages dual-source partial label reasoning to achieve robust performance and prevent catastrophic collapse in noisy 3D voxel spaces where existing 2D noise-robust strategies fail.

Wenxin Li, Kunyu Peng, Di Wen, Junwei Zheng, Jiale Wei, Mengfei Duan, Yuheng Zhang, Rui Fan, Kailun YangMon, 09 Ma💻 cs

Clinical-Injection Transformer with Domain-Adapted MAE for Lupus Nephritis Prognosis Prediction

This paper proposes a novel multimodal framework, the Clinical-Injection Transformer with a domain-adapted MAE, which integrates routine PAS-stained histopathology images and clinical data to achieve high-accuracy three-class prognosis prediction for pediatric lupus nephritis, addressing previous limitations in data availability and modality integration.

Yuewen Huang, Zhitao Ye, Guangnan Feng, Fudan Zheng, Xia Gao, Yutong LuMon, 09 Ma🤖 cs.LG

AI End-to-End Radiation Treatment Planning Under One Second

The paper introduces AIRT, an end-to-end deep-learning framework that generates high-quality, deliverable single-arc VMAT prostate treatment plans in under one second directly from CT images and contours, demonstrating non-inferiority to standard clinical planning systems while significantly accelerating workflow efficiency.

Simon Arberet, Riqiang Gao, Martin Kraus, Florin C. Ghesu, Wilko Verbakel, Mamadou Diallo, Anthony Magliari, Venkatesan Karuppusamy, Sushil Beriwal, REQUITE Consortium, Ali Kamen, Dorin ComaniciuMon, 09 Ma🤖 cs.AI

Technical Report: Automated Optical Inspection of Surgical Instruments

This technical report details a collaboration with industry leaders in Pakistan's Sialkot surgical cluster to develop an Automated Optical Inspection system using deep learning models (YOLOv8, ResNet-152, and EfficientNet-b4) on a new dataset of 4,414 images to detect manufacturing defects in surgical instruments, thereby enhancing patient safety and manufacturing quality.

Zunaira Shafqat, Atif Aftab Ahmed Jilani, Qurrat Ul AinMon, 09 Ma🤖 cs.AI

Longitudinal Lesion Inpainting in Brain MRI via 3D Region Aware Diffusion

This paper introduces a novel pseudo-3D longitudinal inpainting framework based on Denoising Diffusion Probabilistic Models and Region-Aware Diffusion that significantly outperforms state-of-the-art baselines in perceptual fidelity, temporal stability, and processing speed for removing evolving lesions from brain MRI scans.

Zahra Karimaghaloo, Dumitru Fetco, Haz-Edine Assemlal, Hassan Rivaz, Douglas L. ArnoldMon, 09 Ma🤖 cs.AI

Learning Latent Transmission and Glare Maps for Lens Veiling Glare Removal

This paper proposes VeilGen, an unsupervised generative model that learns latent transmission and glare maps to synthesize realistic veiling glare datasets, and DeVeiler, a restoration network that leverages these maps to effectively remove veiling glare from simplified optical systems.

Xiaolong Qian, Qi Jiang, Lei Sun, Zongxi Yu, Kailun Yang, Peixuan Wu, Jiacheng Zhou, Yao Gao, Yaoguang Ma, Ming-Hsuan Yang, Kaiwei WangMon, 09 Ma🔬 physics.optics