Unsupervised Semantic Segmentation in Synchrotron Computed Tomography with Self-Correcting Pseudo Labels

This paper presents a novel unsupervised framework for segmenting large-scale synchrotron computed tomography datasets that generates initial pseudo labels via voxel clustering and refines them using an Unbiased Teacher approach, thereby eliminating the need for manual annotation while significantly improving segmentation accuracy.

Austin Yunker, Peter Kenesei, Hemant Sharma + 3 more2026-03-03💻 cs

DiffSOS: Acoustic Conditional Diffusion Model for Speed-of-Sound Reconstruction in Ultrasound Computed Tomography

DiffSOS is a novel acoustic conditional diffusion model that achieves high-fidelity, near real-time Speed-of-Sound reconstruction in Ultrasound Computed Tomography by leveraging a physics-grounded ControlNet and stochastic sampling to overcome the oversmoothing and computational limitations of existing methods while providing pixel-wise uncertainty estimates.

Yujia Wu, Shuoqi Chen, Shiru Wang + 3 more2026-03-03💻 cs

PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models

To overcome the scarcity of 3D-text data and the resulting loss of geometric information in existing 3D Vision-Language Models, PointAlign introduces a lightweight feature-level alignment regularization that explicitly supervises intermediate point cloud tokens to preserve fine-grained 3D geometric-semantic details, significantly improving performance on classification and captioning tasks.

Yuanhao Su, Shaofeng Zhang, Xiaosong Jia + 1 more2026-03-03💻 cs

Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution

This paper proposes an improved adversarial diffusion compression method that distills a heavy 3D diffusion Transformer into a lightweight 2D-based model with 1D temporal convolutions and a dual-head adversarial scheme, achieving a 95% reduction in parameters and 8×\times speedup while effectively balancing spatial detail and temporal consistency for real-world video super-resolution.

Bin Chen, Weiqi Li, Shijie Zhao + 4 more2026-03-03💻 cs

OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation

This paper introduces OPGAgent, a multi-tool agentic system that enhances the accuracy and audibility of dental panoramic X-ray interpretation by coordinating specialized perception modules through a hierarchical evidence gathering process and a consensus mechanism, while also proposing the OPG-Bench benchmark for comprehensive evaluation beyond standard VQA metrics.

Zhaolin Yu, Litao Yang, Ben Babicka + 7 more2026-03-03🤖 cs.AI