Abnormal data detection method and device and data pre-processing method and system
An abnormal data detection and abnormal data technology, applied in the computer field, can solve problems such as high feature dimension, large difference in sample attributes, data limitation, etc., to avoid interference, ensure stability, strong reliability and versatility.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0043] An embodiment of the present invention proposes a method for detecting abnormal data, which is used to find out abnormal data in the data to be detected. Please refer to figure 1 , the method of this embodiment includes the following steps:
[0044] S101. Perform dimensionality reduction processing on the data set to be detected using a principal component algorithm to form a first data set.
[0045] S102. Reconstruct the first data set using a principal component algorithm to form a second data set, where the second data set has the same dimension as the data set to be detected.
[0046] S103. Calculate a correlation between the data set to be detected and data corresponding to the second data set.
[0047] S104. Obtain abnormal data that is greatly different from corresponding data in the second data set among the data to be detected.
[0048] In step S101, the data to be detected in this embodiment can be, for example, big data such as an image processing system, a...
Embodiment 2
[0076] In the present invention, an algorithm for simplifying a high-dimensional data set based on principal component matrix decomposition may be used, preferably using singular value decomposition (Singular value decomposition, SVD). See figure 2 , which is a flowchart of another abnormal data detection method according to an embodiment of the present invention, which includes the following steps:
[0077] S201. Calculate the covariance matrix of the data set to be detected.
[0078] S202. Decompose the covariance matrix of the data set to be detected through singular value decomposition to obtain a (k, k)-dimensional one-orthogonal matrix. The k is the dimension of the data set to be detected.
[0079] S203. Take the first j dimensions of the orthogonal matrix, and form the projection matrix.
[0080] S204. Calculate the first data set according to the acquired projection matrix and the data set to be detected.
[0081] S205. Reconstruct the first data set using a prin...
Embodiment 3
[0107] The embodiment of the present invention also proposes a data preprocessing method, which is used to find and filter out abnormal data in a large amount of data through the principal component analysis method, and is especially suitable for system input such as image processing, credit card fraud detection, and credit warning. Data preprocessing. In the data preprocessing method of this embodiment, the abnormal data in the data to be detected is obtained first by using the abnormal data detection method, and then the abnormal data in the data to be detected is filtered out. Wherein, the process of the abnormal data detection method is the same as that of the first embodiment and the second embodiment, and will not be repeated here.
[0108] The data preprocessing method in this embodiment can select out the abnormal sample points without assuming that the data to be processed obeys a certain distribution, and is suitable for cases where there are a large number of missin...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com