Decoupling Task and Behavior: A Two-Stage Reward Curriculum in Reinforcement Learning for Robotics
This paper proposes a two-stage reward curriculum that decouples task-specific objectives from behavioral terms to improve exploration and training stability in robotic reinforcement learning, demonstrating superior performance and robustness across multiple environments compared to direct full-reward training.