Integrated Intelligent Energy ›› 2025, Vol. 47 ›› Issue (1): 18-25.doi: 10.3969/j.issn.2097-0706.2025.01.003

• New Power System Scheduling based on AI • Previous Articles     Next Articles

Multivariable integrated power control optimization of wind farms based on deep reinforcement learning

ZHANG Huaqin1,2(), LIU Wei3,*(), WANG Hui1,2, LI Leixiao1,2, Sharengaowa4   

  1. 1. School of Data Science and Applications,Inner Mongolia University of Technology,Hohhot 010080,China
    2. Inner Mongolia Autonomous Region Engineering & Technology Research Center of Big Data Based Software Service,Hohhot 010080, China
    3. Linhe District Meteorological Bureau, Bayannur 015000,China
    4. Public Security Department of Inner Mongolia Autonomous Region,Hohhot 010080,China
  • Received:2024-10-08 Revised:2024-11-13 Published:2025-01-25
  • Contact: LIU Wei E-mail:20221800730@imut.edu.cn;723674266@qq.com
  • Supported by:
    National Natural Science Foundation of China(62362055);Scientific and Technological Innovation Project of Meteorological Bureau of Inner Mongolia Autonomous Region(nmqxkjcx202312);Scientific and Technological Innovation Project of Meteorological Bureau of Inner Mongolia Autonomous Region(nmqxkjcx202435)

Abstract:

China has proposed the"dual carbon"strategy,aiming to build a new power system with renewable energy as the primary component.Based on real wind farm data,optimization control strategies were proposed to improve the wind farm's output power,thereby further enhancing wind energy utilization.The wake effects between wind turbines were the main focus,and a wind farm power multivariable optimization control strategy based on model-free deep reinforcement learning(DRL)was proposed.The strategy employed the Proximal Policy Optimization(PPO)algorithm to optimize multiple variables,including the yaw angle,tilt angle,blade pitch angle,and tip speed ratio(TSR) in a dynamic wind farm.Through intelligent agents learning from the data generated by the agent during operation,an optimal control strategy was obtained,overcoming the limitations of traditional mathematical optimization methods. Simulation results showed that,compared to existing wind turbine control algorithms,the model-free DRL-based multivariable optimization control strategy significantly improved computational efficiency,reduced the difficulty of parameter optimization,and optimized the direction and strength of the wake.The optimized average output power was increased by 37.08%.

Key words: wind farm, deep reinforcement learning, wake control, multivariable control, power optimization, PPO, "dual carbon" strategy, new power system

CLC Number: