Hybrid Energy-Aware Reward Shaping: A Unified Lightweight Physics-Guided Methodology for Policy Optimization
본 논문은 완전한 동역학 모델 없이도 작업 성능과 에너지 효율을 동시에 최적화하며 선형 복잡도로 수렴을 보장하는 경량 물리 기반 강화학습 방법론인 '하이브리드 에너지 인식 보상 형성 (H-EARS)'을 제안하고 실험을 통해 그 유효성을 입증합니다.
Qijun Liao (School of Mechanical Engineering, University of Science and Technology Beijing, China), Jue Yang (School of Mechanical Engineering, University of Science and Technology Beijing, China), Yiting Kang (School of Mechanical Engineering, University of Science and Technology Beijing, China), Xinxin Zhao (School of Mechanical Engineering, University of Science and Technology Beijing, China), Yong Zhang (Jiangsu XCMG Construction Machinery Research Institute Co., Ltd., China), Mingan Zhao (Jiangsu XCMG Construction Machinery Research Institute Co., Ltd., China)2026-03-13🤖 cs.LG