Explaining, Verifying, and Aligning Semantic Hierarchies in Vision-Language Model Embeddings

This paper introduces a post-hoc framework to explain, verify, and align the semantic hierarchies in vision-language model embeddings, revealing that while image encoders offer superior discriminative power, text encoders better align with human taxonomies, highlighting a trade-off between zero-shot accuracy and ontological plausibility.

Gesina Schwalbe, Mert Keser, Moritz Bayerkuhnlein, Edgar Heinert, Annika Mütze, Marvin Keller, Sparsh Tiwari, Georgii Mikriukov, Diedrich Wolter, Jae Hee Lee, Matthias Rottmann2026-03-31🤖 cs.LG

DSO: Dual-Scale Neural Operators for Stable Long-term Fluid Dynamics Forecasting

The paper proposes the Dual-Scale Neural Operator (DSO), a novel architecture that decouples local feature extraction and global trend aggregation to effectively address the long-term stability and precision challenges in fluid dynamics forecasting, achieving state-of-the-art results with over 88% error reduction compared to existing methods.

Huanshuo Dong, Hao Wu, Hong Wang, Qin-Yi Zhang, Zhezheng Hao2026-03-31🤖 cs.LG

Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Document-Routed Retrieval

This paper proposes Hybrid Document-Routed Retrieval (HDRR), a two-stage architecture that resolves the robustness-precision trade-off in financial RAG systems by combining Semantic File Routing for document filtering with chunk-based retrieval, thereby achieving superior performance in accuracy, failure reduction, and perfect-answer rates on the FinDER benchmark.

Zhiyuan Cheng, Longying Lai, Yue Liu2026-03-31💬 cs.CL

PiCSRL: Physics-Informed Contextual Spectral Reinforcement Learning

The paper introduces PiCSRL, a physics-informed contextual spectral reinforcement learning framework that leverages domain knowledge and uncertainty-aware belief models to enable sample-efficient adaptive sensing and optimal station selection in high-dimensional, low-sample-size Earth observation tasks, as demonstrated by its superior performance in mapping cyanobacterial concentrations in Lake Erie using NASA PACE hyperspectral imagery.

Mitra Nasr Azadani, Syed Usama Imtiaz, Nasrin Alamdari2026-03-31🤖 cs.LG

Throughput Optimization as a Strategic Lever in Large-Scale AI Systems: Evidence from Dataloader and Memory Profiling Innovations

This paper argues that throughput optimization has evolved into a critical strategic lever for large-scale AI systems, demonstrating through evidence from dataloader frameworks like OVERLORD, memory techniques such as ZeRO-Offload, and compiler-centric tools like Triton-distributed that a holistic, system-level approach is essential to overcome computational bottlenecks and accelerate the development of next-generation foundation models.

Mayank Jha2026-03-31🤖 cs.LG