多智能体强化学习的多能互补系统优化建模

doi:10.20250131

综合智慧能源

• •

多智能体强化学习的多能互补系统优化建模

陈锋, 路小敏, 胡可, 沈冰, 王军鹏

河南科技大学应用工程学院应用工程学院, 河南 472000 中国
郑州浪潮数据技术有限公司研发部, 河南 450000 中国
华北水利水电大学电气工程学院, 河南 450018 中国
郑州航空工业管理学院电气工程学院, 河南 450018 中国

收稿日期:2025-07-18 修回日期:2025-12-05
基金资助:
国家自然科学基金项目(72342104)

Dynamic optimization modeling of multi-energy complementary energy system based on multi-agent reinforcement learning

LU Xiaomin

, 472000, China
, 450000, China
, 450018, China

Received:2025-07-18 Revised:2025-12-05
Supported by:
National Natural Science Foundation of China(72342104)

摘要/Abstract

摘要： 为解决多能互补能源系统在高比例可再生能源接入下的动态协同优化难题，以及传统集中式方法在多主体利益协调和实时响应中的局限性，开展动态优化建模研究。构建“物理层-决策层-协同层”三层多智能体强化学习框架，将能源生产者、消费者及系统调度器划分为独立智能体。基于改进近端策略优化算法，设计融合经济性、环保性与稳定性的动态奖励函数，通过集中训练-分散执行机制实现分布式决策与全局协同。选取典型的园区级多能互补系统为算例验证显示：所提模型使可再生能源消纳率提升至95.3%，度电成本降低18.7%；在50%负荷突变场景下，系统恢复稳定时间缩短至90秒，较传统混合整数规划方法减少90%；面对±20%风光预测误差，负荷满足率仍保持98.7%。该动态优化模型可有效解决多能互补系统的多主体协同与不确定性适应问题，为高渗透率可再生能源系统的实时优化调度提供技术支撑。

关键词: 多能互补能源系统, 多智能体强化学习, 动态优化建模, 源网荷储协同, 可再生能源消纳, 协同调度

Abstract: In order to solve the problem of dynamic collaborative optimization of multi-energy complementary energy system under high proportion of renewable energy access,and the limitations of traditional centralized methods in multi-agent interest coordination and real-time response,dynamic optimization modeling research is carried out.A three-layer multi-agent reinforcement learning framework of “ physical layer-decision layer-collaborative layer ” is constructed,which divides energy producers,consumers and system dispatchers into independent agents.Based on the improved proximal strategy optimization algorithm,a dynamic reward function integrating economy,environmental protection and stability is designed,and distributed decision-making and global coordination are realized through centralized training-decentralized execution mechanism.A typical park-level multi-energy complementary system is selected as an example to verify the proposed model.The results show that the proposed model can increase the renewable energy consumption rate to 95.3 % and reduce the cost of electricity by 18.7 %.In the 50 % load mutation scenario,the system recovery and stabilization time is shortened to 90 seconds,which is 90 % less than the traditional mixed integer programming method. In the face of ±20% wind and solar prediction error,the load satisfaction rate remains 98.7 %.The dynamic optimization model can effectively solve the problem of multi-agent coordination and uncertainty adaptation of multi-energy complementary systems,and provide technical support for real-time optimal scheduling of high-permeability renewable energy systems.

Key words: Multi-energy complementary energy system, multi-agent reinforcement learning, dynamic optimization modeling, source network load storage coordination, renewable energy consumption, collaborative scheduling

陈锋, 路小敏, 胡可, 沈冰, 王军鹏. 多智能体强化学习的多能互补系统优化建模[J].

LU Xiaomin. Dynamic optimization modeling of multi-energy complementary energy system based on multi-agent reinforcement learning[J]. Integrated Intelligent Energy, doi: 10.20250131.

[1]	张元曦, 杨国华, 马龙腾, 马鑫, 刘耀泽. 基于改进DE算法的园区微电网风光储优化配置[J]. 综合智慧能源, 2025, 47(9): 71-79.
[2]	聂雪颖, 程懋松, 左献迪, 戴志敏. 考虑风光消纳的风光核储混合系统容量优化[J]. 综合智慧能源, 2025, 47(1): 51-61.
[3]	邓振宇, 汪茹康, 徐钢, 云昆, 王颖. 综合能源系统中热电联产机组故障预警现状[J]. 综合智慧能源, 2024, 46(8): 67-76.
[4]	李博, 曹越, 徐景怡, 司风琪. 考虑火电-熔盐储热耦合特性的风光火储能源调度[J]. 综合智慧能源, 2024, 46(12): 55-63.
[5]	张金平, 周强, 王定美, 李津, 刘丽娟. 太阳能光热发电技术及其发展综述[J]. 综合智慧能源, 2023, 45(2): 44-52.
[6]	王晓海, 徐静静, 胡永锋, 刘广宇, 王佑天. 新形势下发电企业在综合能源服务领域的业务分析[J]. 综合智慧能源, 2022, 44(3): 9-16.
[7]	敦文斌, 陈业, 王大伟, 马晓慧. 零碳视角下电气化码头源网荷储协同优化研究[J]. 综合智慧能源, 2022, 44(12): 68-74.
[8]	张会福. 多能互补系统经济运行方法研究[J]. 综合智慧能源, 2022, 44(11): 79-86.
[9]	温港成, 石鑫, 张怡, 房方. 考虑设备变工况特性的园区综合能源系统两阶段规划优化方法研究[J]. 综合智慧能源, 2022, 44(10): 1-11.
[10]	喻小宝, 郑丹丹, 杨康, 孔杰, 章天浩. “双碳”目标下能源电力行业的机遇与挑战[J]. 华电技术, 2021, 43(6): 21-32.
[11]	徐刚磊. 新形势下西南大型水电可持续发展机制的探讨[J]. 华电技术, 2021, 43(5): 80-85.
[12]	赵国涛, 钱国明, 王盛, 丁泉, 朱海东. “双碳”目标下火电企业绿色低碳转型的对策分析[J]. 华电技术, 2021, 43(10): 11-21.
[13]	章文浦, 王强钢. 基于遗传算法的分布式多能互补能源系统优化配置[J]. 华电技术, 2021, 43(1): 52-58.

多智能体强化学习的多能互补系统优化建模

Dynamic optimization modeling of multi-energy complementary energy system based on multi-agent reinforcement learning

PDF

可视化

摘要/Abstract

引用本文

使用本文

参考文献

相关文章 13

编辑推荐

Metrics

本文评价