VocSegMRI: Multimodal Learning for Precise Vocal Tract Segmentation in Real-time MRI

The paper introduces VocSegMRI, a multimodal framework that leverages cross-attention fusion and contrastive learning to integrate video, audio, and phonological signals, achieving state-of-the-art vocal tract segmentation in real-time MRI with a Dice score of 0.95 and robust performance even when audio is unavailable.

Daiqi Liu, Tomás Arias-Vergara, Johannes Enk, Fangxu Xing, Maureen Stone, Jerry L. Prince, Jana Hutter, Andreas Maier, Jonghye Woo, Paula Andrea Pérez-ToroWed, 11 Ma💻 cs

Floating-Point Usage on GitHub: A Large-Scale Study of Statically Typed Languages

This paper presents the first large-scale empirical study of floating-point arithmetic usage in statically typed languages across millions of GitHub repositories, revealing that while existing benchmarks are partially representative, they do not fully capture real-world code patterns, and releasing a dataset of 10 million extracted functions to guide future reasoning techniques.

Andrea Gilot, Tobias Wrigstad, Eva DarulovaWed, 11 Ma💻 cs

CoRe-GS: Coarse-to-Refined Gaussian Splatting with Semantic Object Focus

CoRe-GS is a coarse-to-refine Gaussian Splatting framework that accelerates 3D reconstruction for robotic applications by selectively optimizing only task-relevant points of interest, thereby significantly reducing training time and mitigating artifacts while maintaining high-quality semantic segmentation.

Hannah Schieber, Dominik Frischmann, Victor Schaack, Simon Boche, Angela Schoellig, Stefan Leutenegger, Daniel RothWed, 11 Ma💻 cs

Improving Large Vision-Language Models' Understanding for Flow Field Data

This paper introduces FieldLVLM, a novel framework that enhances Large Vision-Language Models' ability to interpret complex scientific field data by combining a specialized pipeline for extracting physical features into structured text with a data-compressed tuning strategy, resulting in superior performance on scientific benchmarks.

Xiaomei Zhang, Hanyu Zheng, Xiangyu Zhu, Jinghuan Wei, Junhong Zou, Zhen Lei, Zhaoxiang ZhangWed, 11 Ma💻 cs

A Decade of News Forum Interactions: Threaded Conversations, Signed Votes, and Topical Tags

This paper introduces a large-scale, privacy-preserving dataset of ten years of user interactions on the Austrian newspaper DerStandard, comprising over 75 million comments and 400 million votes with anonymized identifiers and pre-computed vector embeddings to facilitate research on online discourse dynamics in the German language.

Emma Fraxanet, Vicenç Gómez, Andreas Kaltenbrunner, Max PellertWed, 11 Ma💻 cs

SpikeSMOKE: Spiking Neural Networks for Monocular 3D Object Detection with Cross-Scale Gated Coding

This paper proposes SpikeSMOKE, a low-power monocular 3D object detection framework based on Spiking Neural Networks that introduces a Cross-Scale Gating Coding mechanism and a lightweight residual block to overcome information loss and computational inefficiency, achieving superior performance on KITTI and other datasets while significantly reducing energy consumption and model complexity compared to traditional ANN-based approaches.

Xuemei Chen, Huamin Wang, Jing Peng, Hangchi Shen, Shukai Duan, Shiping Wen, Tingwen HuangWed, 11 Ma💻 cs

Evaluating Large Language Models for Multilingual Vulnerability Detection at Dual Granularities

This paper presents a comprehensive empirical study evaluating state-of-the-art pre-trained and large language models for multilingual vulnerability detection across seven programming languages at both function and line levels, revealing that instruction-tuned GPT-4o significantly outperforms other models, particularly in identifying high-severity and unique multilingual vulnerabilities.

Honglin Shu, Michael Fu, Junji Yu, Dong Wang, Chakkrit Tantithamthavorn, Junjie Chen, Yasutaka KameiWed, 11 Ma💻 cs

M4-SAR: A Multi-Resolution, Multi-Polarization, Multi-Scene, Multi-Source Dataset and Benchmark for optical-SAR Object Detection

This paper introduces M4-SAR, a large-scale, multi-resolution, multi-polarization, and multi-source dataset with nearly one million labeled instances, alongside a unified benchmarking toolkit and a novel end-to-end fusion framework (E2E-OSDet) that collectively advance optical-SAR object detection by demonstrating significant performance gains over single-source methods in complex environments.

Chao Wang, Wei Lu, Xiang Li, Jian Yang, Lei LuoWed, 11 Ma💻 cs

Scalable and Performant Data Loading

This paper introduces SPDL, an open-source, framework-agnostic library that significantly accelerates GPU data loading by leveraging concurrent thread pool execution with GIL release, achieving up to 74% faster iteration and reduced resource usage compared to PyTorch DataLoader while demonstrating further performance gains with Free-Threaded Python.

Moto Hira, Christian Puhrsch, Valentin Andrei, Roman Malinovskyy, Gael Le Lan, Abhinandan Krishnan, Joseph Cummings, Victor Bourgin, Olga Gerasimova, Miguel Martin, Gokul Gunasekaran, Yuta Inoue, Alex J Turner, Raghuraman KrishnamoorthiWed, 11 Ma💻 cs

Zooming In on Fakes: A Novel Dataset for Localized AI-Generated Image Detection with Forgery Amplification Approach

This paper introduces BR-Gen, a large-scale dataset of 150,000 locally forged images with diverse scene-aware annotations, and proposes NFA-ViT, a noise-guided Vision Transformer that amplifies subtle forgery traces to significantly improve the detection and generalization of localized AI-generated image forgeries.

Lvpan Cai, Haowei Wang, Jiayi Ji, Yanshu Zhoumen, Shen Chen, Taiping Yao, Xiaoshuai SunWed, 11 Ma💻 cs

Modeling Concurrency Control as a Learnable Function

This paper introduces NeurCC, a novel learned concurrency control algorithm that utilizes Bayesian optimization and a graph reduction search to efficiently learn a high-performance function mapping database states to control actions, thereby consistently outperforming state-of-the-art algorithms across diverse and dynamic workloads.

Hexiang Pan, Shaofeng Cai, Tien Tuan Anh Dinh, Yuncheng Wu, Yeow Meng Chee, Gang Chen, Beng Chin OoiWed, 11 Ma💻 cs