PointAlign: Feature-Level Alignment Regularization for 3D Vision-Language Models

To overcome the scarcity of 3D-text data and the resulting loss of geometric information in existing 3D Vision-Language Models, PointAlign introduces a lightweight feature-level alignment regularization that explicitly supervises intermediate point cloud tokens to preserve fine-grained 3D geometric-semantic details, significantly improving performance on classification and captioning tasks.

Yuanhao Su, Shaofeng Zhang, Xiaosong Jia + 1 more2026-03-03💻 cs

Improved Adversarial Diffusion Compression for Real-World Video Super-Resolution

This paper proposes an improved adversarial diffusion compression method that distills a heavy 3D diffusion Transformer into a lightweight 2D-based model with 1D temporal convolutions and a dual-head adversarial scheme, achieving a 95% reduction in parameters and 8×\times speedup while effectively balancing spatial detail and temporal consistency for real-world video super-resolution.

Bin Chen, Weiqi Li, Shijie Zhao + 4 more2026-03-03💻 cs

OPGAgent: An Agent for Auditable Dental Panoramic X-ray Interpretation

This paper introduces OPGAgent, a multi-tool agentic system that enhances the accuracy and audibility of dental panoramic X-ray interpretation by coordinating specialized perception modules through a hierarchical evidence gathering process and a consensus mechanism, while also proposing the OPG-Bench benchmark for comprehensive evaluation beyond standard VQA metrics.

Zhaolin Yu, Litao Yang, Ben Babicka + 7 more2026-03-03🤖 cs.AI

Benchmarking Few-shot Transferability of Pre-trained Models with Improved Evaluation Protocols

This paper introduces FEWTRANS, a comprehensive benchmark and the Hyperparameter Ensemble (HPE) evaluation protocol to rigorously assess few-shot transfer learning, revealing that pre-trained model selection and full-parameter fine-tuning often outperform sophisticated adaptation methods due to their ability to make distributed micro-adjustments without overfitting.

Xu Luo, Ji Zhang, Lianli Gao + 2 more2026-03-03🤖 cs.LG