Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Dynamic grid optimization-based LOF (Local Outlier Factor) clustering data anomaly point detection method and detection system

A data anomaly and dynamic grid technology, applied in structured data retrieval, database model, relational database, etc., can solve the problems of limited application of abnormal point detection, high time complexity, large time and space, etc., to reduce the amount of calculation , reduce the calculation time, and the effect of small amount of calculation

Inactive Publication Date: 2017-10-17
THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP
View PDF0 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, most of the existing outlier algorithms have high time complexity, and often spend a lot of time and space when dealing with large-scale data.
In addition, most of the current data mining software has a high dependence on the database environment to be detected, and the versatility is poor.
There is no software that can automatically mine data outliers for any basic database, which greatly limits the application of outlier detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dynamic grid optimization-based LOF (Local Outlier Factor) clustering data anomaly point detection method and detection system
  • Dynamic grid optimization-based LOF (Local Outlier Factor) clustering data anomaly point detection method and detection system
  • Dynamic grid optimization-based LOF (Local Outlier Factor) clustering data anomaly point detection method and detection system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The invention discloses a method for detecting abnormal points of LOF clustering data based on dynamic grid optimization and a detection system using the method. The following takes the detection of abnormal points of two-dimensional spatial data sets as an example and further clarifies the present invention in conjunction with the accompanying drawings.

[0046] Firstly, the present invention discloses a method for detecting abnormal points of LOF clustering data based on dynamic grid optimization, comprising the following steps:

[0047] (1) According to the initial cell grid vector M 0 (m 1 ,m 2 ) and growth vector △p(△p 1 ,△p 2 ), to get the optimal element grid vector M opt ;

[0048] The data space can be divided into multiple grids according to a selected cell grid vector, such as figure 1 Shown is the distribution map of a data set in two-dimensional space. The unit length division of each dimension in the data space is independent, take a smaller unit gr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a dynamic grid optimization-based LOF (Local Outlier Factor) clustering data anomaly point detection method and detection system. The detection method comprises the following steps: 1, obtaining an optimal unit grid vector Mopt according to an initial unit grid vector M0 and a growth vector delta p; 2, dividing data space into a dense region Rd, a sparse region Rs and a transition region Rt according to the optimal unit grid vector Mopt; 3. dividing all grids in the transition region Rt, which is obtained in the step 2, into dense region grids G'd and sparse region grids G's according to grid periphery density ratio vectors F, and adding the sparse region grids G's into the sparse region Rs to form a sparse region Regions of the data space; and 4, applying an LOF algorithm to detect anomaly points of the sparse region Regions of the data space. According to the method, the operation data amount of the LOF algorithm is reduced through the dynamic grids, and the computation time of the LOF algorithm is greatly reduced.

Description

technical field [0001] The invention belongs to the field of data mining, and mainly relates to a data abnormal point detection method and a detection system. Background technique [0002] With the advent of the big data era, how to effectively and efficiently analyze and mine the information contained in these large-scale data sets is of great significance. Among the information contained in large-scale database datasets, data outliers often contain potentially valuable information. Outlier detection is an important research branch of data mining. Its main function is to extract data that is very different from mainstream data and has a very small amount from huge and complex data. At present, researchers have proposed a large number of outlier detection algorithms, mainly including methods based on statistical distribution, methods based on distance, and detection methods based on clustering. However, most of the existing outlier algorithms have high time complexity, and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/2465G06F16/285
Inventor 金鑫刘晓晖卢明许田丹叶健聪张硕戴楠
Owner THE 28TH RES INST OF CHINA ELECTRONICS TECH GROUP CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products