Data exception detection method, device, equipment and storage medium

A data anomaly and detection method technology, applied in the field of data processing, can solve the problems of low detection accuracy, high manpower, material and financial resources, and small proportion of abnormal data

Pending Publication Date: 2020-10-23
PING AN TECH (SHENZHEN) CO LTD
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] 1. Most of the massive data generated during the running of the program is unlabeled, and data labeling often requires professionals, so it takes a lot of manpower, material and financial resources to obtain enough data labels
[0006] 2. The proportion of abnormal data is small, and it is more cumbersome to find potential abnormal points and their corresponding classification from a large amount of data
[0007] The density-based method belongs to unsupervised learning, which can be completed without data labeling, but the detection accuracy is usually not high, and there is no theoretical support for the classification results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data exception detection method, device, equipment and storage medium
  • Data exception detection method, device, equipment and storage medium
  • Data exception detection method, device, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] In order to make the purpose, technical solution and advantages of the present application clearer, the present application will be further described below in conjunction with the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described here are only used to explain the present application, not to limit the present application. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0056] Such as figure 1 As shown, an embodiment of the present application provides a data anomaly detection method, including:

[0057] S1. Obtain unlabeled data.

[0058] In the actual intelligent operation process, the data generated by the computer system is often unbalanced, and most of the data is normal data. Therefore, the abnormal data detection in the operation process can be re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data exception detection method, a device, equipment and a storage medium. The method comprises the steps of obtaining unmarked data; extracting primary abnormal data from the unmarked data according to a preset query strategy; identifying and marking the primary abnormal data, storing the primary abnormal data into a marked first data set to form a second data set, and training a pre-trained hypersphere classification model through the second data set; identifying whether the hyper-sphere classification model meets a training termination condition or not; and when the training termination condition is reached, inputting the unmarked data into the hypersphere classification model under the training termination condition for classification screening to obtain target abnormal data. According to the method, a small amount of marked data is used for training the classification model, the classification model is used for classifying the unmarked data after the training termination condition is met, original distribution of the data is not limited, the data volume needing to be marked by an operator is reduced, and the classification result accuracy is high.

Description

technical field [0001] The present application relates to the technical field of data processing in artificial intelligence, and in particular to a data anomaly detection method, device, equipment and storage medium. Background technique [0002] The monitoring of computer systems is an important part of AIOps. In the process of monitoring computer systems, the CPU and disk of the computer system will generate a large amount of index data, which will also contain some abnormal values. Through the branching of abnormal points, the cause of system abnormality can be found out, and suggestions for subsequent operations can be provided. Therefore, anomaly detection technology plays an important role in the field of intelligent operation. [0003] Traditional anomaly detection includes statistical and density-based methods. [0004] Statistics-based methods often find out suspected outliers by training a large amount of labeled data, which belongs to supervised learning. Accor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F11/30G06K9/62
CPCG06F11/3065G06F18/24143G06F18/214
Inventor 邓悦郑立颖徐亮
Owner PING AN TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products