TY - GEN
T1 - Multi-Agent Q-Value Mixing Network with Covariance Matrix Adaptation Strategy for the Voltage Regulation Problem
AU - Wang, Yiwen
AU - Zhang, Senlin
AU - Liu, Meiqin
AU - Dong, Shanling
AU - Zheng, Ronghao
N1 - Publisher Copyright:
© 2023 Technical Committee on Control Theory, Chinese Association of Automation.
PY - 2023
Y1 - 2023
N2 - Control and optimization of power systems typically involves schemes that utilize optimal power flow techniques and comprehensive modeling of various electrical components. However, the extensive integration of renewable energy sources and distributed energy resources makes it difficult to obtain accurate models, making the traditional model-based approaches more challenging. In this paper, we propose a Covariance Matrix Adaptation Q-value network mixing method (CMAQMIX), which is a novel model-free multi-agent reinforcement learning method that combines the advantages of Covariance Matrix Adaptation Evolution Strategy and the Q-value network mixing method to solve the problem of continuous action space. We establish a new multi-agent voltage regulation environment based on CityLearn framework to test the CMAQMIX method. The results show that our proposed method outperforms the Independent Proximal Policy Optimization method and can give immediate action response without the complete domain knowledge of the power system.
AB - Control and optimization of power systems typically involves schemes that utilize optimal power flow techniques and comprehensive modeling of various electrical components. However, the extensive integration of renewable energy sources and distributed energy resources makes it difficult to obtain accurate models, making the traditional model-based approaches more challenging. In this paper, we propose a Covariance Matrix Adaptation Q-value network mixing method (CMAQMIX), which is a novel model-free multi-agent reinforcement learning method that combines the advantages of Covariance Matrix Adaptation Evolution Strategy and the Q-value network mixing method to solve the problem of continuous action space. We establish a new multi-agent voltage regulation environment based on CityLearn framework to test the CMAQMIX method. The results show that our proposed method outperforms the Independent Proximal Policy Optimization method and can give immediate action response without the complete domain knowledge of the power system.
KW - Battery energy storage system
KW - Demand Response
KW - Evolution Strategy
KW - Heat and Air Conditioning
KW - Multi-Agent Reinforcement Learning
KW - Voltage Regulation
UR - https://www.scopus.com/pages/publications/85175524059
U2 - 10.23919/CCC58697.2023.10240322
DO - 10.23919/CCC58697.2023.10240322
M3 - 会议稿件
AN - SCOPUS:85175524059
T3 - Chinese Control Conference, CCC
SP - 7170
EP - 7175
BT - 2023 42nd Chinese Control Conference, CCC 2023
PB - IEEE Computer Society
T2 - 42nd Chinese Control Conference, CCC 2023
Y2 - 24 July 2023 through 26 July 2023
ER -