A data cleaning method based on two-dimensional probability density estimation and a quartile method

A quartile method and probability density technology, which is applied in the field of wind turbine data measurement and processing, can solve the problems of lack of universality, inability to effectively identify the data category of the transition area, and affect the integrity and correctness of data samples, etc., to achieve strong versatility Effect

Active Publication Date: 2019-06-21
NORTH CHINA ELECTRIC POWER UNIV (BAODING) +1
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Considering the complex and changeable characteristics of the actual power curve, traditional data cleaning methods generally lack versatility, especially for

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A data cleaning method based on two-dimensional probability density estimation and a quartile method
  • A data cleaning method based on two-dimensional probability density estimation and a quartile method
  • A data cleaning method based on two-dimensional probability density estimation and a quartile method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The embodiments will be described in detail below in conjunction with the accompanying drawings.

[0026] The present invention relates to a data cleaning method based on two-dimensional probability density estimation and quartile method, such as figure 1 As shown, the specific implementation steps are as follows:

[0027] (1) Collect data such as wind speed, power, and pitch angle at the hub height of the wind turbine, and draw the measured wind speed-power scatter diagram of the wind turbine, as shown in the attached figure 2 shown; draw the wind turbine measured power-pitch angle scatter diagram, as attached image 3 shown. The normalization formula (1) is used to preprocess the data to form a sample data set;

[0028]

[0029] (2) Using the two-dimensional non-parametric kernel density estimation method to calculate the joint probability density function of power and pitch angle, as shown in formula (2), the measured power-pitch angle two-dimensional probabil...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of wind turbine generator data measurement and processing, and particularly relates to a data cleaning method based on two-dimensional probability densityestimation and a quartile method, which comprises the following steps: collecting wind speed, power and pitch angle data at the height of a hub of a wind turbine generator, and performing normalization preprocessing to form a sample data set; Calculating a joint probability density function of the power and the pitch angle by adopting a two-dimensional non-parameter kernel density estimation method, and determining a pitch angle boundary position through a grid division method; Dividing power and pitch angle data into normal data and abnormal data based on the pitch angle boundary line, and obtaining normal wind speed and power data by adopting a data time benchmarking method; And for omission of abnormal data after cleaning, adopting a quartering method to perform data recleaning. The method is high in universality, the data category of the transition area can be effectively identified, a large number of accumulation type power limiting data can be scientifically cleaned, and a reliable data basis is provided for efficiency evaluation, performance analysis, state diagnosis, health management, power prediction and the like of the wind turbine generator.

Description

technical field [0001] The invention belongs to the technical field of wind turbine data measurement and processing, and in particular relates to a data cleaning method based on two-dimensional probability density estimation and quartile method. Background technique [0002] The measured power curve of wind turbines can represent the actual operating conditions of wind turbines, and it is the premise and key to evaluate and predict the power characteristics of wind turbines. The measured power curve of wind turbines is dynamic, affected by a series of factors such as meteorological factors, environmental conditions, system control, equipment failure, etc., the collected wind speed power data has a large number of abnormal data, which cannot be directly used for power curve fitting. Abnormal data cleaning needs to be performed first. Considering the complex and changeable characteristics of the actual power curve, traditional data cleaning methods generally lack versatility,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/215
Inventor 韩爽乔延辉葛畅刘永前李莉阎洁褚景春
Owner NORTH CHINA ELECTRIC POWER UNIV (BAODING)
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products