Axial-Centric Cross-Plane Attention for 3D Medical Image Classification

This paper proposes an axial-centric cross-plane attention architecture that leverages a frozen MedDINOv3 foundation model and directional transformer encoders to align 3D medical image classification with clinical workflows, demonstrating superior performance over existing methods by prioritizing the axial plane while integrating complementary coronal and sagittal information.

Doyoung Park, Jinsoo Kim, Lohendran Baskaran2026-02-26💻 cs

HybridINR-PCGC: Hybrid Lossless Point Cloud Geometry Compression Bridging Pretrained Model and Implicit Neural Representation

HybridINR-PCGC is a novel point cloud geometry compression framework that bridges pretrained models and implicit neural representations by utilizing a Pretrained Prior Network to accelerate the convergence of a Distribution Agnostic Refiner, thereby achieving superior compression rates and encoding efficiency while mitigating the limitations of data dependency and bitstream overhead.

Wenjie Huang, Qi Yang, Shuting Xia + 3 more2026-02-26💻 cs

Learning spatially adaptive sparsity level maps for arbitrary convolutional dictionaries

This paper presents an enhanced image reconstruction method that embeds neural network-inferred spatially adaptive sparsity maps into a model-based convolutional dictionary framework, achieving filter-permutation invariance, inference-time dictionary flexibility, and improved robustness to data distribution shifts compared to purely black-box deep learning approaches.

Joshua Schulz, David Schote, Christoph Kolbitsch + 2 more2026-02-26⚡ eess

Assessing airborne laser scanning and aerial photogrammetry for deep learning-based stand delineation

This study demonstrates that deep learning-based forest stand delineation achieves comparable accuracy using temporally aligned digital photogrammetry-derived canopy height models and digital terrain models as it does with airborne laser scanning data, suggesting that large-scale, consistent datasets can be assembled without relying on the more complex and temporally misaligned ALS data.

Håkon Næss Sandum, Hans Ole Ørka, Oliver Tomic + 1 more2026-02-26💻 cs

TranX-Adapter: Bridging Artifacts and Semantics within MLLMs for Robust AI-generated Image Detection

To address the attention dilution caused by high intra-feature similarity in artifact detection, the paper proposes TranX-Adapter, a lightweight fusion module that integrates Task-aware Optimal-Transport and X-Fusion mechanisms to effectively combine semantic and artifact features within MLLMs, significantly boosting AI-generated image detection accuracy.

Wenbin Wang, Yuge Huang, Jianqing Xu + 5 more2026-02-26💻 cs

SigVLP: Sigmoid Volume-Language Pre-Training for Self-Supervised CT-Volume Adaptive Representation Learning

SigVLP introduces a self-supervised vision-language pre-training framework for CT volumes that utilizes Rotary Position Embeddings to handle variable input sizes without information loss and employs chunkwise volume-text alignment for finer-grained, more precise representation learning across diverse downstream medical imaging tasks.

Jiayi Wang, Hadrien Reynaud, Ibrahim Ethem Hamamci + 4 more2026-02-26💻 cs