Reasoning as Representation: Rethinking Visual Reinforcement Learning in Image Quality Assessment

This paper reveals that the generalization of reinforcement learning-based image quality assessment models stems from their conversion of visual data into compact text representations, leading to the proposal of RALI, a lightweight algorithm that directly aligns images with these representations to achieve comparable performance with significantly reduced computational costs.

Shijie Zhao, Xuanyu Zhang, Weiqi Li + 4 more2026-03-04💻 cs

CASR-Net: An Image Processing-focused Deep Learning-based Coronary Artery Segmentation and Refinement Network for X-ray Coronary Angiogram

This paper introduces CASR-Net, a three-stage deep learning pipeline featuring a novel multichannel preprocessing strategy and a Self-ONN-based UNet architecture that achieves state-of-the-art coronary artery segmentation and refinement on X-ray angiograms, thereby enhancing the accuracy of coronary artery disease diagnosis.

Alvee Hassan, Rusab Sarmun, Muhammad E. H. Chowdhury + 4 more2026-03-04🤖 cs.AI

PrismAudio: Decomposed Chain-of-Thoughts and Multi-dimensional Rewards for Video-to-Audio Generation

PrismAudio is a novel video-to-audio generation framework that addresses objective entanglement and human preference alignment by integrating a decomposed Chain-of-Thought reasoning structure with multi-dimensional rewards and a computationally efficient Fast-GRPO algorithm, achieving state-of-the-art performance across semantic, temporal, aesthetic, and spatial dimensions.

Huadai Liu, Kaicheng Luo, Wen Wang + 6 more2026-03-04⚡ eess