Factuality Matters: When Image Generation and Editing Meet Structured Visuals

This paper addresses the limitations of current image generation models in handling structured visuals by introducing a comprehensive framework that includes a 1.3-million-pair dataset, a unified VLM-FLUX.1 model trained with a three-stage curriculum and external reasoning, and the StructBench benchmark with StructScore metric to evaluate and improve factual fidelity in chart and diagram generation and editing.

Le Zhuo, Songhao Han, Yuandong Pu + 8 more2026-03-05💻 cs

A Geometry-Based View of Mahalanobis OOD Detection

This paper reveals that the reliability of Mahalanobis-based out-of-distribution detection is highly dependent on the geometric properties of the feature space, specifically within-class spectral structure and local intrinsic dimensionality, and proposes a radially scaled 2\ell_2 normalization method that dynamically adjusts feature radii to optimize detection performance based on these geometric signals.

Denis Janiak, Jakub Binkowski, Tomasz Kajdanowicz2026-03-05🤖 cs.LG

Weakly Supervised Concept Learning with Class-Level Priors for Interpretable Medical Diagnosis

This paper introduces Prior-guided Concept Predictor (PCP), a weakly supervised framework that leverages class-level concept priors and regularization to enable reliable, interpretable medical diagnosis without costly concept annotations, significantly outperforming zero-shot baselines while matching fully supervised models.

Md Nahiduzzaman, Steven Korevaar, Alireza Bab-Hadiashar + 1 more2026-03-05💻 cs