The invention discloses a method for determining a thermal
power unit operation optimization target value based on Apache Spark and an improved FP-Growth
algorithm. The method includes the following steps of: S1, selecting influence parameters of a power supply
coal consumption rate, collecting historical operation data according to the influence parameters, and performing data preprocessing and steady-state detection; S2, screening operation optimization parameters from the influence parameters of the power supply
coal consumption rate by using Pearson
correlation analysis; S3, improving an FP-Growth
algorithm based on a matrix technology; S4, parallelizing the improved FP-Growth
algorithm based on Apache Spark; S5, discretizing the data of the operation optimization parameters, mining afrequent pattern from a
data set by using the parallel improved FP-Growth algorithm, inversely discretizing a mining result, and sorting out the mining result to obtain the target value of the
unit operation optimization parameters under various working conditions. The method for determining the target value of the thermal
power unit operation optimization parameters disclosed by the invention isbased on the Apache Spark and the improved FP-Growth algorithm, and has the advantages of short time consumption, low memory usage and high efficiency of mining massive data.