TY - GEN
T1 - Optimal Energy Scheduling for Microgrids with a Hybrid Battery-Hydrogen Storage System Based on Reinforcement Learning
AU - Jia, Jinhua
AU - Cui, Feifei
AU - An, Dou
N1 - Publisher Copyright:
© 2024 IEEE.
PY - 2024
Y1 - 2024
N2 - This paper explores the integration of battery and hydrogen storage in a Microgrid (MG), combining the high-power capabilities of battery with the high-capacity characteristics of hydrogen storage to manage instantaneous load variations and balance peak-valley energy differences. The scheduling method employs the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm, recognizing the complexity of the integrated energy system. 24-hour simulations are conducted for representative days across the four seasons - spring, summer, autumn, and winter. The TD3 algorithm minimizes fluctuations in power exchanges with the main grid by effectively synchronizing the operations of battery and Hydrogen Storage System (HSS), thus enhancing grid stability. It learns the patterns between load demands and renewable energy generation, favoring battery charging and hydrogen production during low demand periods and discharging or utilizing hydrogen for electricity during peak demand. This strategy promotes more effective utilization of renewable resources. Additionally, a comparison with the Deep Deterministic Policy Gradient (DDPG) algorithm shows that while the cost-related performance for both TD3 and DDPG is nearly identical, TD3 demonstrates significantly better stability, with standard deviations consistently lower than those of DDPG. In particular, under winter conditions, the standard deviation of TD3 is only 45.5% of that of DDPG.
AB - This paper explores the integration of battery and hydrogen storage in a Microgrid (MG), combining the high-power capabilities of battery with the high-capacity characteristics of hydrogen storage to manage instantaneous load variations and balance peak-valley energy differences. The scheduling method employs the Twin Delayed Deep Deterministic Policy Gradient (TD3) algorithm, recognizing the complexity of the integrated energy system. 24-hour simulations are conducted for representative days across the four seasons - spring, summer, autumn, and winter. The TD3 algorithm minimizes fluctuations in power exchanges with the main grid by effectively synchronizing the operations of battery and Hydrogen Storage System (HSS), thus enhancing grid stability. It learns the patterns between load demands and renewable energy generation, favoring battery charging and hydrogen production during low demand periods and discharging or utilizing hydrogen for electricity during peak demand. This strategy promotes more effective utilization of renewable resources. Additionally, a comparison with the Deep Deterministic Policy Gradient (DDPG) algorithm shows that while the cost-related performance for both TD3 and DDPG is nearly identical, TD3 demonstrates significantly better stability, with standard deviations consistently lower than those of DDPG. In particular, under winter conditions, the standard deviation of TD3 is only 45.5% of that of DDPG.
KW - TD3 algorithm
KW - battery
KW - hydrogen storage system
KW - microgrids
UR - https://www.scopus.com/pages/publications/85207451081
U2 - 10.1109/CCSSTA62096.2024.10691805
DO - 10.1109/CCSSTA62096.2024.10691805
M3 - 会议稿件
AN - SCOPUS:85207451081
T3 - Proceedings of 2024 IEEE 25th China Conference on System Simulation Technology and its Application, CCSSTA 2024
SP - 845
EP - 850
BT - Proceedings of 2024 IEEE 25th China Conference on System Simulation Technology and its Application, CCSSTA 2024
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 25th IEEE China Conference on System Simulation Technology and its Application, CCSSTA 2024
Y2 - 21 July 2024 through 23 July 2024
ER -