Boosting deep Reinforcement Learning using pretraining with Logical Options
This paper proposes Hybrid Hierarchical RL (H^2RL), a two-stage framework that leverages logical option-based pretraining to inject symbolic structure into deep reinforcement learning agents, effectively mitigating reward misalignment and improving long-horizon decision-making while outperforming existing neural, symbolic, and neuro-symbolic baselines.