Deep reinforcement learning with spatial and temporal awareness for active boundary control of buoyancy-driven convection
This paper proposes a deep reinforcement learning framework that overcomes the degenerate actuation issues of prior methods by integrating convolutional networks, recurrent memory, off-policy training, and action-smoothness constraints, successfully achieving significant heat transfer reduction in Rayleigh–Bénard convection and adaptive mixing enhancement in double-diffusive convection without requiring full-field data augmentation.