Unlock instant, AI-driven research and patent intelligence for your innovation.

An anomaly detection method and computer-readable storage medium for a heterogeneous data set

An anomaly detection and heterogeneity technology, applied in the computer field, can solve problems such as inability to detect anomalies in heterogeneous data sets

Active Publication Date: 2021-01-05
北京志翔科技股份有限公司
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The present invention provides an anomaly detection method for a heterogeneous data set and a computer-readable storage medium to solve the problem in the prior art that the anomaly detection of a heterogeneous data set cannot be performed well

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An anomaly detection method and computer-readable storage medium for a heterogeneous data set
  • An anomaly detection method and computer-readable storage medium for a heterogeneous data set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The embodiment of the present invention aims at the existing problem that it is impossible to accurately detect abnormalities in high-dimensional unlabeled heterogeneous data sets. By selecting several unused classification indicators from the preset classification indicator set, based on the selected classification indicators, the The heterogeneous data set performs index threshold segmentation processing, generates data subsets after segmentation and classification under each classification index, and performs anomaly detection on each data subset, so as to achieve accurate high-dimensional unlabeled heterogeneous data sets to test. The present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0027] The first embodiment of the present invention provi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an anomaly detection method of a heterogeneous data set and a computer-readable storage medium. The invention selects several unused classification indexes from a preset classification index set, and based on the selected classification indexes, detects heterogeneity The data set is subjected to index threshold segmentation processing to generate data subsets after segmentation and classification under the selected classification index, and to perform anomaly detection on each data subset, that is to say, the present invention classifies the classification index based on the selected classification index The data under the index threshold segmentation process is performed to obtain multiple data subsets under the selected classification index. By performing anomaly detection on the data subsets, it is possible to accurately perform high-dimensional unlabeled heterogeneous data sets. abnormal detection.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to an anomaly detection method and a computer-readable storage medium for a heterogeneous data set. Background technique [0002] At present, the anomaly detection of data sets is mainly carried out through statistical hypothesis testing and isolation forest method. Specifically, statistical hypothesis testing needs to assume that the data obeys a certain distribution, which is only applicable to one-dimensional data, while isolation forest requires each random Select the dimension and threshold to split the data set until each set has only one piece of data, forming an isolated tree. The fewer the number of splits, the higher the outlier score. However, due to the different anomaly detection thresholds of heterogeneous data sets, the existing statistical hypothesis testing and isolation forest methods cannot detect anomalies in heterogeneous data sets. Contents of the invention...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/62
CPCG06F18/2433
Inventor 巩国栋严朝豪薛野宋洋孙凯
Owner 北京志翔科技股份有限公司