Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

DBSCAN abnormal data identification and detection method based on core point reservation

A technology of abnormal data and detection method, applied in the field of abnormal data identification and detection of DBSCAN, to achieve the effect of reducing noise points and improving scalability

Pending Publication Date: 2020-03-31
HUAQIAO UNIVERSITY
View PDF0 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] Aiming at the deficiencies of the prior art, the present invention provides an abnormal data identification and detection method based on DBSCAN with core point retention, which solves the problem that the identification method of abnormal data has almost no multivariate inconsistency check

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • DBSCAN abnormal data identification and detection method based on core point reservation
  • DBSCAN abnormal data identification and detection method based on core point reservation
  • DBSCAN abnormal data identification and detection method based on core point reservation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0044] see Figure 1-6, the present invention provides a technical solution: a method for identifying and detecting abnormal data based on DBSCAN reserved by core points, comprising the following steps:

[0045] S1: Randomly divide the data set into training set and test set;

[0046] S2: Construct the nearest neighbor matrix Croe-M and inverse neighbor matrix Croe-MR of the training set, and use the inverse neighbor matrix Croe-MR as the basis for determinin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a DBSCAN abnormal data identification and detection method based on core point reservation. The method comprises the following steps of S1, randomly dividing a data set into atraining set and a test set; s2, constructing a neighbor matrix Cre-M and an inverse neighbor matrix Cre-MR of the training set, and taking the inverse neighbor matrix Cre-MR as a basis for judging acore point. The invention relates to the technical field of anomaly identification and detection methods. According to the DBSCAN abnormal data identification and detection method based on core pointreservation, the density is redefined, the training set is clustered and labeled by using the density, and thus, data in a remaining test set is classified by using the label; a training set and a test set are divided through random sampling, so that the expansibility of the detection method is improved; meanwhile, only core points are adopted to establish a model, so that the influence of noise points, especially edge points, on a classification result is effectively reduced; through density definition, the weight of the sample points in the data set category can be well represented, and a better classification effect is achieved.

Description

technical field [0001] The invention relates to the technical field of abnormal identification and detection methods, in particular to an abnormal data identification and detection method based on DBSCAN with core point retention. Background technique [0002] Anomaly identification is a detection method for outlier sample points in a data set. Anomalies have rich connotations and may be noise, errors, or rare values. In the field of data mining, its generally accepted definition is a point that is produced by other mechanisms and deviates from the majority of observations. In this paper, the point opposite to the "outlier point" is called "normal point". [0003] As an important research direction, anomaly recognition has been widely used in real-world applications such as credit card fraud recognition, disease diagnosis and prevention, network intrusion, measurement error, and abnormal electricity consumption behavior. [0004] Statistical-Based Anomaly Identification M...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/2321G06F18/2433
Inventor 高振国胡凌岳陈丹杰蔡绍滨王田莫毓昌陈益峰张忆文
Owner HUAQIAO UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products