Integrated Intelligent Energy ›› 2024, Vol. 46 ›› Issue (10): 32-39.doi: 10.3969/j.issn.2097-0706.2024.10.005

• Power Grid and AI • Previous Articles     Next Articles

Decentralized voltage control of distribution network based on multi-agent reinforcement learning

MA Gang1,2(), MA Jian2, YAN Yunsong1, CHEN Yonghua1, LAI Yening1, LI Zhukun1, TANG Jing1   

  1. 1. State Key Laboratory of Technology and Equipment for Defense Against Power System Operational Risks, Nari Technology Company Limited, Nanjing 211106, China
    2. School of Electrical and Automation Engineering,Nanjing Normal University, Nanjing 210023, China
  • Received:2024-07-15 Revised:2024-08-27 Accepted:2024-09-13 Published:2024-10-25
  • Supported by:
    Science and Technology Project of Jiangsu Province(BK20232026);State Key Laboratory of Smart Grid Protection and Operation Control(SGNR0000KTTS2302147)

Abstract:

The integration of large-scale decentralized resources into the distribution network has changed the traditional power flow distribution, resulting in frequent voltage violations. Model-based voltage control methods require a detailed knowledge of power system network topology and have long computation time, making them unsuitable for real-time voltage control.To address this, this paper proposes a multi-agent online learning strategy for decentralized voltage control in distribution networks, considering asynchronous training. The method considered each photovoltaic (PV) inverter as an agent. First, the agents were partitioned and adjusted, then the voltage reactive power control problem of distribution network was modelled as a Markov decision process. Based on distributed system constraints, a multi-agent reinforcement learning decentralized control framework was used, and agents were trained with a multi-agent deep deterministic policy gradient(MADDPG) algorithm. Once trained, the agents could make decentralized decisions using local information without real-time communication, enabling real-time voltage control and reducing network losses by determining the output plan for the PV inverters. Finally, the effectiveness and robustness of the method were verified through simulation.

Key words: distribution network, multi-agent, voltage decentralized control, multi-agent deep deterministic policy gradient, Markov decision process

CLC Number: