Factuality Matters: When Image Generation and Editing Meet Structured Visuals
This paper addresses the limitations of current image generation models in handling structured visuals by introducing a comprehensive framework that includes a 1.3-million-pair dataset, a unified VLM-FLUX.1 model trained with a three-stage curriculum and external reasoning, and the StructBench benchmark with StructScore metric to evaluate and improve factual fidelity in chart and diagram generation and editing.