Factuality Matters: When Image Generation and Editing Meet Structured Visuals

This paper addresses the limitations of current image generation models in handling structured visuals by introducing a comprehensive framework that includes a 1.3-million-pair dataset, a unified VLM-FLUX.1 model trained with a three-stage curriculum and external reasoning, and the StructBench benchmark with StructScore metric to evaluate and improve factual fidelity in chart and diagram generation and editing.

Le Zhuo, Songhao Han, Yuandong Pu + 8 more2026-03-05💻 cs

A Geometry-Based View of Mahalanobis OOD Detection

This paper reveals that the reliability of Mahalanobis-based out-of-distribution detection is highly dependent on the geometric properties of the feature space, specifically within-class spectral structure and local intrinsic dimensionality, and proposes a radially scaled 2\ell_2 normalization method that dynamically adjusts feature radii to optimize detection performance based on these geometric signals.

Denis Janiak, Jakub Binkowski, Tomasz Kajdanowicz2026-03-05🤖 cs.LG