HSG-12M: A Large-Scale Benchmark of Spatial Multigraphs from the Energy Spectra of Non-Hermitian Crystals

This paper introduces Poly2Graph, an automated pipeline for generating HSG-12M, a pioneering 16.7-million-scale dataset of spatial multigraphs derived from non-Hermitian crystal energy spectra, which bridges condensed matter physics and geometry-aware graph learning by preserving vital geometric information often discarded in existing benchmarks.

Xianquan Yan, Hakan Akgün, Kenji Kawaguchi + 2 more2026-03-06🔬 cond-mat.mes-hall

EDINET-Bench: Evaluating LLMs on Complex Financial Tasks using Japanese Financial Statements

This paper introduces EDINET-Bench, a challenging open-source benchmark derived from ten years of Japanese financial reports to evaluate LLMs on complex tasks like fraud detection and earnings forecasting, revealing that current models struggle significantly without specialized scaffolding and highlighting the need for more realistic evaluation frameworks.

Issa Sugiura, Takashi Ishida, Taro Makino + 4 more2026-03-06💻 cs

From Bandit Regret to FDR Control: Online Selective Generation with Adversarial Feedback Unlocking

This paper proposes ExSUL, a novel online learning framework that enables selective generation for large language models to robustly control the False Discovery Rate (FDR) and achieve optimal regret bounds in non-stationary and adversarial environments by converting bandit regret into FDR guarantees and unlocking additional learning signals from partial user feedback.

Minjae Lee, Yoonjae Jung, Sangdon Park2026-03-06💻 cs

Structured Kolmogorov-Arnold Neural ODEs for Interpretable Learning and Symbolic Discovery of Nonlinear Dynamics

This paper introduces Structured Kolmogorov-Arnold Neural ODEs (SKANODEs), a framework that combines structured state-space modeling with Kolmogorov-Arnold Networks to accurately recover interpretable physical latent states and discover compact symbolic governing equations for nonlinear dynamical systems, outperforming black-box neural ODEs and classical identification methods across synthetic and real-world datasets.

Wei Liu, Kiran Bacsa, Loon Ching Tang + 1 more2026-03-06🔬 physics

Learning Physical Systems: Symplectification via Gauge Fixing in Dirac Structures

This paper introduces Presymplectification Networks (PSNs), a novel framework that restores non-degenerate symplectic geometry for constrained and dissipative mechanical systems by learning a symplectification lift via Dirac structures, thereby enabling accurate, structure-preserving long-term prediction of complex multibody dynamics like those of the ANYmal quadruped robot.

Aristotelis Papatheodorou, Pranav Vaidhyanathan, Natalia Ares + 1 more2026-03-06💻 cs

MuRating: A High Quality Data Selecting Approach to Multilingual Large Language Model Pretraining

MuRating is a scalable framework that transfers high-quality English data-quality signals to a unified multilingual evaluator via pairwise comparisons and translation, enabling the selection of balanced, high-quality datasets that significantly improve the performance of multilingual large language models on both English and non-English benchmarks.

Zhixun Chen, Ping Guo, Wenhan Han + 10 more2026-03-06💻 cs

Some Super-approximation Rates of ReLU Neural Networks for Korobov Functions

This paper establishes nearly optimal super-approximation error bounds of order 2m2m and 2m22m-2 in LpL_p and Wp1W^1_p norms, respectively, for ReLU neural networks approximating Korobov functions by leveraging sparse grid finite elements and bit extraction, thereby demonstrating that neural network expressivity effectively overcomes the curse of dimensionality.

Yuwen Li, Guozhi Zhang2026-03-06💻 cs

Kernel Based Maximum Entropy Inverse Reinforcement Learning for Mean-Field Games

This paper proposes a kernel-based maximum causal entropy inverse reinforcement learning framework for infinite-horizon stationary mean-field games that models unknown rewards in a reproducing kernel Hilbert space to capture nonlinear structures, proves the algorithm's theoretical consistency via Fréchet differentiability, and demonstrates superior policy recovery performance over linear baselines in traffic routing scenarios while extending the approach to finite-horizon non-stationary settings.

Berkay Anahtarci, Can Deha Kariksiz, Naci Saldi2026-03-06🔢 math