SafeGen-LLM: Enhancing Safety Generalization in Task Planning for Robotic Systems
This paper introduces SafeGen-LLM, a two-stage post-training framework combining supervised fine-tuning and GRPO with formal verification rewards to enhance the safety satisfaction and generalization of robotic task planning across diverse domains and input formats.