cs.AI papers | Gist.Science

Explaining, Verifying, and Aligning Semantic Hierarchies in Vision-Language Model Embeddings

This paper introduces a post-hoc framework to explain, verify, and align the semantic hierarchies in vision-language model embeddings, revealing that while image encoders offer superior discriminative power, text encoders better align with human taxonomies, highlighting a trade-off between zero-shot accuracy and ontological plausibility.

Gesina Schwalbe, Mert Keser, Moritz Bayerkuhnlein, Edgar Heinert, Annika Mütze, Marvin Keller, Sparsh Tiwari, Georgii Mikriukov, Diedrich Wolter, Jae Hee Lee, Matthias Rottmann2026-03-31🤖 cs.LG

DSO: Dual-Scale Neural Operators for Stable Long-term Fluid Dynamics Forecasting

The paper proposes the Dual-Scale Neural Operator (DSO), a novel architecture that decouples local feature extraction and global trend aggregation to effectively address the long-term stability and precision challenges in fluid dynamics forecasting, achieving state-of-the-art results with over 88% error reduction compared to existing methods.

Huanshuo Dong, Hao Wu, Hong Wang, Qin-Yi Zhang, Zhezheng Hao2026-03-31🤖 cs.LG

Sparse-by-Design Cross-Modality Prediction: L0-Gated Representations for Reliable and Efficient Learning

The paper proposes L0GM, a modality-agnostic framework that employs hard-concrete stochastic gating to enforce L0-style sparsity on learned representations across graphs, language, and tabular data, thereby enabling unified, efficient, and reliable cross-modality learning with improved probability calibration.

Filippo Cenacchi2026-03-31🤖 cs.LG

The Language of Touch: Translating Vibrations into Text with Dual-Branch Learning

This paper introduces ViPAC, a dual-branch learning framework that generates natural language descriptions from vibrotactile signals by disentangling their periodic and aperiodic components, and validates the approach using the newly constructed LMT108-CAP dataset.

Jin Chen, Yifeng Lin, Chao Zeng, Si Wu, Tiesong Zhao2026-03-31💻 cs

GroupRAG: Cognitively Inspired Group-Aware Retrieval and Reasoning via Knowledge-Driven Problem Structuring

Inspired by cognitive science's view of problem-solving as a search over structured spaces, GroupRAG introduces a framework that identifies latent problem groups to enable fine-grained, multi-perspective retrieval and reasoning, thereby outperforming traditional RAG and Chain-of-Thought baselines on MedQA.

Xinyi Duan, Yuanrong Tang, Jiangtao Gong2026-03-31💬 cs.CL

Implicit neural representations for larval zebrafish brain microscopy: a reproducible benchmark on the MapZebrain atlas

This paper establishes a reproducible benchmark for implicit neural representations on the MapZebrain larval zebrafish atlas, demonstrating that explicit spectral encodings like Haar and Fourier features outperform smoother alternatives in preserving high-frequency neuroanatomical boundaries and fine neuronal processes.

Agnieszka Pregowska2026-03-31💻 cs

Resolving the Robustness-Precision Trade-off in Financial RAG through Hybrid Document-Routed Retrieval

This paper proposes Hybrid Document-Routed Retrieval (HDRR), a two-stage architecture that resolves the robustness-precision trade-off in financial RAG systems by combining Semantic File Routing for document filtering with chunk-based retrieval, thereby achieving superior performance in accuracy, failure reduction, and perfect-answer rates on the FinDER benchmark.

Zhiyuan Cheng, Longying Lai, Yue Liu2026-03-31💬 cs.CL

PiCSRL: Physics-Informed Contextual Spectral Reinforcement Learning

The paper introduces PiCSRL, a physics-informed contextual spectral reinforcement learning framework that leverages domain knowledge and uncertainty-aware belief models to enable sample-efficient adaptive sensing and optimal station selection in high-dimensional, low-sample-size Earth observation tasks, as demonstrated by its superior performance in mapping cyanobacterial concentrations in Lake Erie using NASA PACE hyperspectral imagery.

Mitra Nasr Azadani, Syed Usama Imtiaz, Nasrin Alamdari2026-03-31🤖 cs.LG

Epileptic Seizure Prediction Using Patient-Adaptive Transformer Networks

This paper proposes a patient-adaptive transformer framework that combines self-supervised pretraining with patient-specific fine-tuning to achieve high-accuracy short-horizon seizure prediction from EEG recordings, effectively addressing inter-patient variability and complex temporal signal structures.

Mohamed Mahdi, Asma Baghdadi2026-03-31🤖 cs.LG

Throughput Optimization as a Strategic Lever in Large-Scale AI Systems: Evidence from Dataloader and Memory Profiling Innovations

This paper argues that throughput optimization has evolved into a critical strategic lever for large-scale AI systems, demonstrating through evidence from dataloader frameworks like OVERLORD, memory techniques such as ZeRO-Offload, and compiler-centric tools like Triton-distributed that a holistic, system-level approach is essential to overcome computational bottlenecks and accelerate the development of next-generation foundation models.

Mayank Jha2026-03-31🤖 cs.LG

← Previous Next →