Data preprocessing method
A data preprocessing, data point technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problem of low efficiency in judging a large amount of data one by one, and achieve the effect of reliable removal, improved accuracy and efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0044] Such as figure 1 As shown, the data preprocessing method of this embodiment includes the following steps:
[0045] S 101 1. Selecting a plurality of data points as the first data group, each data point in the first data group includes a first coordinate value and a second coordinate value;
[0046] S 102 , removing data points whose first coordinate values are different from the first coordinate values of all other data points in the first data group as a second data group;
[0047] S 103 1. The data points with the same first coordinate value in the second data group are used as sub-point groups, and all sub-point groups are set to an uncalculated state, and the number threshold k of points in the same group is set;
[0048] S 104 , judging whether there are sub-point groups in the uncalculated state, and executing step S when the judging result is yes 105 , execute step S when the judgment result is negative 112 ;
[0049] S 105 1. Select an uncalculated ...
Embodiment 2
[0064] Compared with the data preprocessing method of embodiment 1, the difference of the data preprocessing method of this embodiment only lies in:
[0065] In this step S 113 and the step S 114 There is also a step S 1131 : Use all the data points of the denoised data set for curve fitting to obtain a second fitting curve and a second standard deviation, and make the distance from the second fitting curve greater than or equal to three times the second standard deviation All data points of are removed from the denoised data set.
[0066] In this step S 102 and the step S 103 There is also a step S 1021 : remove the data points with the largest and smallest second coordinate values from the second data group.
[0067] Through the above steps, the reliability of screening abnormal data points can be further improved.
[0068] Such as figure 2 shows the distribution of data points for the price and sales values of the raw data, figure 2 , image 3 , Figure 4 A...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com