综合智慧能源 ›› 2023, Vol. 45 ›› Issue (7): 61-69.doi: 10.3969/j.issn.2097-0706.2023.07.007

• 优化运行与控制 • 上一篇    下一篇

基于MADDPG算法的建筑群柔性负荷优化调控方法

包义辛1(), 徐椤赟1,2(), 杨强1,*()   

  1. 1.浙江大学 电气工程学院,杭州 310027
    2.浙江省白马湖实验室有限公司,杭州 310056
  • 收稿日期:2023-06-05 修回日期:2023-07-03 接受日期:2023-07-25 出版日期:2023-07-25 发布日期:2023-07-25
  • 通讯作者: *杨强(1979),男,教授,博士,博士生导师,从事从事综合能源系统规划与运行控制研究,qyang@zju.edu.cn
  • 作者简介:包义辛(2001),男,在读硕士研究生,从事电力系统优化规划与运行调控等方面的研究,3190103975@zju.edu.cn
    徐椤赟(1993),男,博士,从事新能源微电网高效运行与稳定性等方面的研究,luoyun.xu@outlook.com
  • 基金资助:
    中国电机工程学会青年人才托举工程项目(CSEE-YESS-2021020)

Optimized control method for flexible load of a building complex based on MADDPG reinforcement learning

BAO Yixin1(), XU Luoyun1,2(), YANG Qiang1,*()   

  1. 1. College of Electrical Engineering,Zhejiang University,Hangzhou 310027 China
    2. Zhejiang Baima Lake Laboratory Company Limited,Hangzhou 310056, China
  • Received:2023-06-05 Revised:2023-07-03 Accepted:2023-07-25 Online:2023-07-25 Published:2023-07-25
  • Supported by:
    The Young Elite Scientists Sponsorship Program by CSEE(CSEE-YESS-2021020)

摘要:

随着电网调度环境和信息整理环境日趋复杂,电网调控的难度也随之增加。针对深度强化学习技术具有有效感知复杂系统运行状态、适应性强、可扩展性好等特点,提出了基于深度强化学习的配网优化调度方法。构建了考虑源-网-荷-储的模拟建筑体配网模型,从原理出发对多智能体深度确定性策略梯度(MADDPG)算法进行静态优化,将模型与真实数据导入适用于电网级目标的多智能体强化学习框架中,尝试用优化后的算法对配网系统进行电压调控。结果表明,所用算法基本消除了配网系统的违规峰值电压,降低了总体电压偏差;优化后的多目标导向算法在保持电压稳定的同时减小了负载-发电功率差,使负载功率损耗维持较低水平,表明基于深度强化学习的建筑群柔性负荷优化调控方法具有一定有效性。

关键词: 微电网调控, 能量管理, 深度强化学习, 确定性策略梯度, 多目标优化, 源网荷储, 建筑群柔性负荷

Abstract:

The power grid dispatch environment and information organization environment have become more complex, and the difficulty of power grid regulation has gradually increased. Since deep reinforcement learning technology is of effective perception on complex system operation statuses,strong adaptability and good scalability,a distribution network optimization scheduling method based on deep reinforcement learning is proposed. Based on the simulated source-network-load-storage integrated distribution network model of a building complex,Multi-Agent Deep Deterministic Policy Gradient(MADDPG)algorithm was statically optimized from its principle.The model and real data were input into a multi-agent reinforcement learning framework suitable for grid-level objectives,and the optimized algorithm was tried to regulate the voltage of the distribution network system. The results show that the algorithm basically eliminates the abnormal peak voltages and reduces the overall voltage deviation.The optimized multi-objective oriented algorithm reduces the load-generated power difference while levelling the voltage off at a low level. The optimized control method for building complex flexible load based on reinforcement learning is proven to be effective.

Key words: microgrid regulation, energy management, deep reinforcement learning, deterministic policy gradient, multi-objective optimization, source-grid-load- storage, flexible load of buildings

中图分类号: