Graph-GRPO: Training Graph Flow Models with Reinforcement Learning
This paper introduces Graph-GRPO, an online reinforcement learning framework that enhances Graph Flow Models through analytical transition probabilities and a localized refinement strategy, achieving state-of-the-art performance in graph generation and molecular optimization tasks.