Mask-Guided Attention Regulation for Anatomically Consistent Counterfactual CXR Synthesis

This paper proposes an inference-time attention regulation framework that utilizes anatomy-aware gating and pathology-guided latent corrections to achieve anatomically consistent and precisely localized counterfactual chest X-ray synthesis, effectively overcoming the structural drift and unstable pathology expression issues common in standard diffusion-based editing methods.

Zichun Zhang, Weizhi Nie, Honglin Guo + 1 more2026-03-05💻 cs

HBRB-BoW: A Retrained Bag-of-Words Vocabulary for ORB-SLAM via Hierarchical BRB-KMeans

This paper proposes HBRB-BoW, a refined hierarchical training algorithm that integrates global real-valued flows to preserve high-fidelity descriptor information before final binarization, thereby overcoming the precision loss of traditional binary clustering and significantly enhancing the discriminative power and performance of ORB-SLAM in loop closing and relocalization tasks.

Minjae Lee, Sang-Min Choi, Gun-Woo Kim + 1 more2026-03-05💻 cs

LISTA-Transformer Model Based on Sparse Coding and Attention Mechanism and Its Application in Fault Diagnosis

This paper proposes a LISTA-Transformer model that integrates Learnable Iterative Shrinkage Threshold Algorithm-based sparse coding with the Transformer architecture to overcome the limitations of CNNs and standard Transformers in local and global feature modeling, achieving a 98.5% fault recognition rate on the CWRU dataset through time-frequency signal analysis.

Shuang Liu, Lina Zhao, Tian Wang + 1 more2026-03-05💻 cs

Beyond Mixtures and Products for Ensemble Aggregation: A Likelihood Perspective on Generalized Means

This paper establishes a principled theoretical framework for density aggregation by demonstrating that normalized generalized means with order r[0,1]r \in [0,1] are the only rules guaranteeing systematic improvements in log-likelihood over individual distributions, thereby providing a unified justification for the widespread use of linear and geometric pooling in Deep Ensembles.

Raphaël Razafindralambo, Rémy Sun, Frédéric Precioso + 2 more2026-03-05🤖 cs.LG

A multi-center analysis of deep learning methods for video polyp detection and segmentation

This multi-center study evaluates deep learning methods for real-time video polyp detection and segmentation, demonstrating that integrating sequence data and temporal information significantly enhances diagnostic precision by addressing the challenges of variable polyp appearance and reducing missed detection rates in colonoscopy.

Noha Ghatwary, Pedro Chavarias Solano, Mohamed Ramzy Ibrahim + 24 more2026-03-05💻 cs

CubeComposer: Spatio-Temporal Autoregressive 4K 360° Video Generation from Perspective Video

CubeComposer is a novel spatio-temporal autoregressive diffusion model that overcomes the computational limitations of existing methods to natively generate high-quality, seam-free 4K-resolution 360° videos from perspective inputs by decomposing them into cubemap representations and employing efficient context management and continuity-aware techniques.

Lingen Li, Guangzhi Wang, Xiaoyu Li + 5 more2026-03-05🤖 cs.AI