基于并行计算的大数据挖掘技术及其在电站锅炉性能优化中的应用

刘炳含; 付忠广; 王永智; 王鹏凯; 高学伟

基于并行计算的大数据挖掘技术及其在电站锅炉性能优化中的应用

Big Data Mining Technology Based on Parallel Algorithm and Its Application in Power Plant Boiler Performance Optimization

摘要

摘要: 针对传统数据挖掘方法无法胜任与日俱增的海量数据挖掘工作的问题，引入大数据挖掘技术，以粗糙集属性约简方法为基础，对经典K-means聚类算法进行改进，实现其在Hadoop平台的MapReduce框架上的并行化计算，形成满足海量数据挖掘工作的新算法。以某600 MW燃煤发电机组海量运行数据为挖掘对象，采用新算法对典型负荷工况下影响锅炉效率的运行参数进行挖掘，挖掘出可调控机组运行参数的最优目标值。结果表明：新算法可用于锅炉海量运行数据优化目标值的确定，节能减排效果良好，其挖掘出的优化目标值代表了历史最优可达值，可指导锅炉优化运行。

Abstract: To meet the ever-increasing demands of massive data mining that traditional methods are not able to deal with, a new algorithm was proposed based on rough set attribute reduction method and improved classical K-meas clustering algorithm by applying big data mining technology, so as to realize parrallel computation on MapReduce framework of Hadoop platform. Taking the massive data of a 600 MW coal-fired power unit as the mining object, the new algorithm was applied to optimize the adjustable operation parameters that may affect the boiler efficiency under typical load conditions. Results show that the new method can be used to determine the optimal target values of massive operation data, with remarkable effects in energy saving and emission reduction; the optimal target values mined by the new algorithm represent historical optimal reachable values, which may serve as a reference for operation optimization of similar boilers.

HTML全文

参考文献(25)

施引文献

资源附件(0)