Power generation big data preprocessing method and system based on big data analysis platform
A big data and preprocessing technology, applied in the field of power informatization, can solve the problems of inaccurate data, inconsistency, incomplete power generation big data, etc., to achieve the effect of improving efficiency and accuracy, reducing data processing, and improving data quality
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0036] see figure 1 , the method for preprocessing power generation big data based on the big data analysis platform of the present embodiment includes the following steps:
[0037] S1: Extract the operating data of the power plant's generator set from the real-time database of the power plant, generate txt text data, and upload it to the big data analysis platform, so as to perform data cleaning and data mining on the stored power generation big data in the big data analysis platform ;
[0038] S2: When it is necessary to call the operation data of the generator set of the power plant, filter the operation data according to the judging rules for the start and stop of the generator set, and delete the data of the start and stop of the generator set from the operation data of the generator set of the power plant obtained from the big data analysis platform; The judging condition of shutdown data is: satisfy load≤8MW and speed≤2900r / Min at the same time. Start-up and shutdown ...
Embodiment 2
[0044] see figure 2 , the method for preprocessing power generation big data based on the big data analysis platform of the present embodiment includes the following steps:
[0045]S1: Data collection and storage. By extracting the real-time database of the plant-level monitoring information system of the power plant (a plant-level automation information system that integrates process real-time monitoring, optimization control and production process management, Supervisory information system inplant leve, SIS system for short), the information of the power plant generator set is extracted. Run the data, generate TXT text data, and upload it to the HDFS (Hadoop Distributed File System) distributed storage system of the big data analysis platform. After file merging and format conversion, the TXT data file is converted into parquet format and stored for big data analysis platform. The data files in the platform are basically stored in the HDFS file system. HDFS supports the s...
Embodiment 3
[0072] see figure 2 , this embodiment collects 184 historical energy consumption index data such as load, main steam pressure, and power supply coal consumption of a supercritical 600MW unit in a certain power plant in the last year on the big data analysis platform, and uses the above big data preprocessing method for the sample data Carry out cleaning and preprocessing, eliminate non-real data, and judge the stability of the working conditions, and obtain health data under stable working conditions for data mining analysis. Specific steps are as follows:
[0073] S1: Data collection and storage.
[0074] Based on the historical operation data of unit #3, the latest year's data is collected, with a total of 525,600 pieces. The total amount of data is 4.5GB. When collecting data, the files are collected in two batches in txt format. Through data merging and format conversion, the data is merged into a file and stored in the HDFS file storage system.
[0075] S2: filter t...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


