HeroGS: Hierarchical Guidance for Robust 3D Gaussian Splatting under Sparse Views

HeroGS is a unified framework that enhances robust 3D Gaussian Splatting under sparse-view conditions by employing a hierarchical guidance strategy across image, feature, and parameter levels to regularize Gaussian distributions, refine high-frequency details, and ensure geometric consistency, thereby achieving superior reconstruction fidelity compared to state-of-the-art methods.

Jiashu Li, Xumeng Han, Zhaoyang Wei + 5 more2026-03-04💻 cs

Continuous Exposure-Time Modeling for Realistic Atmospheric Turbulence Synthesis

This paper introduces ET-Turb, a large-scale synthetic dataset and a novel exposure-time-dependent modulation transfer function (ET-MTF) framework that models atmospheric turbulence blur as a continuous function of exposure time, thereby enabling more realistic turbulence synthesis and significantly improving the generalization of vision models on real-world data compared to existing methods.

Junwei Zeng, Dong Liang, Sheng-Jun Huang + 2 more2026-03-04💻 cs

InterCoG: Towards Spatially Precise Image Editing with Interleaved Chain-of-Grounding Reasoning

This paper presents InterCoG, a novel text-vision interleaved chain-of-grounding reasoning framework that enhances fine-grained image editing in complex multi-entity scenes by explicitly deducing target locations through text-based spatial reasoning before performing visual grounding and outcome specification, supported by a new dataset and auxiliary training modules to ensure spatial precision.

Yecong Wan, Fan Li, Chunwei Wang + 3 more2026-03-04💻 cs

Forecasting as Rendering: A 2D Gaussian Splatting Framework for Time Series Forecasting

This paper introduces TimeGS, a novel time series forecasting framework that reframes prediction as 2D generative rendering by leveraging adaptive Gaussian kernels and a chronologically continuous rasterization mechanism to overcome the topological mismatches and resolution inefficiencies of existing 2D reshaping methods, thereby achieving state-of-the-art performance.

Yixin Wang, Yifan Hu, Peiyuan Liu + 3 more2026-03-04🤖 cs.AI

From Visual to Multimodal: Systematic Ablation of Encoders and Fusion Strategies in Animal Identification

This study presents a multimodal animal identification framework that leverages a massive dataset of 1.9 million images and synthetic textual descriptions to achieve an 84.28% Top-1 accuracy, representing an 11% improvement over unimodal baselines through systematic ablation of encoders and an optimal gated fusion strategy.

Vasiliy Kudryavtsev, Kirill Borodin, German Berezin + 3 more2026-03-04💻 cs

Beyond Prompt Degradation: Prototype-guided Dual-pool Prompting for Incremental Object Detection

This paper proposes PDP, a novel prompt-decoupled framework for Incremental Object Detection that utilizes a dual-pool prompting paradigm to separate task-general and task-specific knowledge while employing a prototypical pseudo-label generation module to mitigate prompt drift, thereby achieving state-of-the-art performance on MS-COCO and PASCAL VOC benchmarks.

Yaoteng Zhang, Zhou Qing, Junyu Gao + 1 more2026-03-04🤖 cs.AI

Loss Design and Architecture Selection for Long-Tailed Multi-Label Chest X-Ray Classification

This paper presents a systematic evaluation of loss functions, architectures, and post-training strategies for long-tailed multi-label chest X-ray classification on the CXR-LT 2026 benchmark, demonstrating that LDAM-DRW combined with a ConvNeXt-Large backbone and classifier re-training achieves a top-5 ranking with 0.3950 mAP while offering practical insights into the development-to-test performance gap.

Nikhileswara Rao Sulake2026-03-04⚡ eess