Integrated Intelligent Energy ›› 2023, Vol. 45 ›› Issue (7): 61-69.doi: 10.3969/j.issn.2097-0706.2023.07.007

• Optimal Operation and Control • Previous Articles     Next Articles

Optimized control method for flexible load of a building complex based on MADDPG reinforcement learning

BAO Yixin1(), XU Luoyun1,2(), YANG Qiang1,*()   

  1. 1. College of Electrical Engineering,Zhejiang University,Hangzhou 310027 China
    2. Zhejiang Baima Lake Laboratory Company Limited,Hangzhou 310056, China
  • Received:2023-06-05 Revised:2023-07-03 Accepted:2023-07-25 Online:2023-07-25 Published:2023-07-25
  • Supported by:
    The Young Elite Scientists Sponsorship Program by CSEE(CSEE-YESS-2021020)

Abstract:

The power grid dispatch environment and information organization environment have become more complex, and the difficulty of power grid regulation has gradually increased. Since deep reinforcement learning technology is of effective perception on complex system operation statuses,strong adaptability and good scalability,a distribution network optimization scheduling method based on deep reinforcement learning is proposed. Based on the simulated source-network-load-storage integrated distribution network model of a building complex,Multi-Agent Deep Deterministic Policy Gradient(MADDPG)algorithm was statically optimized from its principle.The model and real data were input into a multi-agent reinforcement learning framework suitable for grid-level objectives,and the optimized algorithm was tried to regulate the voltage of the distribution network system. The results show that the algorithm basically eliminates the abnormal peak voltages and reduces the overall voltage deviation.The optimized multi-objective oriented algorithm reduces the load-generated power difference while levelling the voltage off at a low level. The optimized control method for building complex flexible load based on reinforcement learning is proven to be effective.

Key words: microgrid regulation, energy management, deep reinforcement learning, deterministic policy gradient, multi-objective optimization, source-grid-load- storage, flexible load of buildings

CLC Number: