Black Box Meta-Learning Intrinsic Rewards
This paper introduces a black-box meta-learning approach that optimizes intrinsic rewards to enhance data efficiency and generalization in sparse-reward continuous control environments, demonstrating its effectiveness compared to extrinsic rewards and meta-learned advantage functions.