基于大数据平台和并行随机森林算法的能耗预测模型优化

摘要/Abstract

摘要：

利用Hadoop，Spark，Hbase等构建分布式大数据分析平台，在此基础上通过数据采集和预处理获得健康的数据集，建立并行随机森林算法的能耗回归预测模型，全面分析和比较基于随机森林预测模型的输入与模型参数、输出之间的关系。重点比较分析了决策树数量、决策树深度、最大分裂数等参数对训练模型精度、运行时效、复杂度的影响，得到该预测模型的最优化参数，实现供电煤耗的精准预测与软测量计算。

关键词:

">">大数据分析平台, 随机森林回归算法, Spark分布式计算, 供电煤耗, 预测

Abstract:

A healthy data set is acquired through data collection and preprocessing based on the construction of distributed big data analysis platform such as Hadoop, Spark and Hbase. Regression forecasting model of energy consumption based on the parallel random forest algorithm is built to comprehensively analyze and compare the relationship between input based on random forest prediction model, model parameters and output. The emphasis lies on comparative analysis of the decision tree number, depth of the decision tree and maximum number of split, which will affect the training model accuracy, running time and complexity. Optimization of the prediction model can achieve accurate prediction on the coal consumption for power supply and soft measurement calculation.

Key words:

big data analysis platform, random forest regression algorithm, Spark distributed computing, coal consumption, prediction

肖祥武，文雯，白全生，胡卫东，李志金，刘克勤. 基于大数据平台和并行随机森林算法的能耗预测模型优化[J]. 华电技术, 2018, 40(7): 1-4.

XIAO Xiangwu, WEN Wen, BAI Quansheng, HU Weidong, LI Zhijin, LIU Keqing.

Optimization of energy consumption forecast model based on big data platform and parallel random forest

[J]. Huadian Technology, 2018, 40(7): 1-4.

[1]	丛星亮, 谢红, 苏阳, 张骏, 程英捷. 660 MW超超临界二次再热机组深度调峰试验研究[J]. 华电技术, 2021, 43(5): 64-69.
[2]	吴思明，童家麟，吴跃森，齐勇，孙五一 . 某亚临界锅炉综合升级改造实践及其性能分析[J]. 华电技术, 2020, 42(6): 66-71.
[3]	王志永1，许贺2，朱慧敏3，葛俊沛1，许静1. 耗差分析法在超临界压力直流锅炉能耗分析中的应用[J]. 华电技术, 2020, 42(1): 58-62.
[4]	蔡雨晴1，杨平1，沈丛奇2，康英伟1，归一数2，徐春梅1. 单相受热管集总参数模型优化及现代控制工程应用[J]. 华电技术, 2019, 41(6): 1-5.
[5]	万立明1，梅振锋2，李德波3，苏朋1，邱家煜1. 中速磨煤机入口紧凑型圆形一次风道流场均流技术研究[J]. 华电技术, 2019, 41(6): 6-12.
[6]	胡美玲1，赵雪瑞2，赵耀丽2. 基于以太网的水电站卷扬式启闭机控制系统的设计[J]. 华电技术, 2019, 41(6): 13-16.
[7]	梁鹏威，康振全，黄浩然，马莉莉，刘洪星，朱朝磊. 基于ANSYS的齿轮模态特性研究[J]. 华电技术, 2019, 41(6): 17-22.
[8]	张巍1a，周国鹏1b，汪洋1a，倪浩1a，钟著辉2. 一起220kV主变压器保护装置发TV异常告警的原因分析[J]. 华电技术, 2019, 41(6): 23-26.
[9]	牛腾赟. 某电厂发电设备可靠性建模及状态预测[J]. 华电技术, 2019, 41(6): 27-32.
[10]	谢坤1,2，赵谦1,2，程成1,2，李帅1,2，蔡亮亮1,2. 具有异常分析功能的通用型数字量输入式合并单元[J]. 华电技术, 2019, 41(6): 33-37.
[11]	苏攀，沈阳，于鹏峰，韩静. 正平衡供电煤耗升高原因分析及煤耗修正计算[J]. 华电技术, 2019, 41(6): 38-41.
[12]	王晓华，谈志林，宫建，李连军. 燃煤电厂氟塑料换热系统的技术经济分析[J]. 华电技术, 2019, 41(6): 42-45.
[13]	徐克涛，何永兵，裴煜坤，张杨. 某燃煤电厂SCR脱硝装置堵塞问题分析及改进[J]. 华电技术, 2019, 41(6): 46-49.
[14]	叶罗，吴俊东，陈显. 低温省煤器系统运行实践分析[J]. 华电技术, 2019, 41(6): 50-52.
[15]	靳军，种西虎，李广伟. 空气预热器排烟温度偏差分析[J]. 华电技术, 2019, 41(6): 53-56.