Integrated Intelligent Energy ›› 2026, Vol. 48 ›› Issue (1): 43-58.doi: 10.3969/j.issn.2097-0706.2026.01.005

• Optimization and Scheduling of Integrated Intelligent Energy System • Previous Articles     Next Articles

Optimal scheduling strategy for REHMIS-IES based on SC-SAC algorithm

PAN Lei(), DING Yunfei(), PANG Yi*(), WANG Yuxuan(), CHEN Jianwei(), GAO Rui(), ZHANG Liyang()   

  1. School of Control and Mechanical Engineering, Tianjin Chengjian University, Tianjin 300384, China
  • Received:2025-08-12 Revised:2025-10-21 Published:2026-01-25
  • Contact: PANG Yi E-mail:panlei4089@163.com;819667068@qq.com;primepang@163.com;1442030308@qq.com;jianwei_chen1314@126.com;gaorui223@126.com;zlysghr@163.com
  • Supported by:
    Key Research and Development Project of Tianjin(25YFXTHZ00530)

Abstract:

The renewable energy-hydrogen-methanol integrated station(REHMIS) produces green hydrogen using electricity generated from renewable energy sources, and further synthesizes methanol from the green hydrogen and carbon dioxide, thereby achieving the substitution of green hydrogen for hydrogen produced from conventional fossil fuels. To simultaneously meet the methanol load demand of REHMIS and the multi-energy demand of its supporting buildings, a novel integrated energy system(IES) topology named REHMIS-IES was designed. To obtain an efficient operation strategy for REHMIS-IES, an execution framework based on the strictly constrained soft actor-critic(SC-SAC) algorithm was proposed. The established mathematical model was transformed into a Markov decision process, and a state constraint mechanism(SCM) was incorporated to prevent drastic fluctuations in the state of the energy storage system. In the execution stage of the SC-SAC algorithm, the trained Q-network and action constraints were transformed into a mixed-integer linear programming(MILP) model to ensure that scheduling decisions could comply with all operational constraints. The results from multi-scenario simulations showed that the proposed system could effectively reduce operating costs while meeting multi-energy demands. Compared with other deep reinforcement learning algorithms, the SC-SAC algorithm could lower the system energy imbalance by approximately 16.2% and reduce operating costs by at least 11.7%.

Key words: renewable energy-hydrogen-methanol integrated station, green hydrogen, energy storage, integrated energy system, deep reinforcement learning, state constraint mechanism, soft actor-critic algorithm, mixed-integer linear programming

CLC Number: