Isolated forest-based binary classification abnormal point detection method and information data processing terminal

A detection method, a binary classification technology, applied in the direction of instrumentation, computing, character and pattern recognition, etc., can solve the problems of sparse expression of the original data set, loss of completeness of the data set, lack of abnormal detection standards, etc., to achieve enhanced robustness, The effect of reducing the amount of actual data and improving the efficiency of anomaly detection

Inactive Publication Date: 2019-07-23
CHENGDU UNIV OF INFORMATION TECH
View PDF3 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0023] (1) The existing technology does not sparsely express the original data set, resulting in redundant detection of normal data points in the process of anomaly detection
[0024] (2) The existing technology does not have a better treatment for the abnormal points on the surface where the edge point and the two subspaces intersect, resulting in the addition of normal data to the abnormal data set, and many data will produce fuzzy operations, lacking more specific and strict anomaly detection criteria; not very robust
[0025] (3) The traditional anomaly detection algorithm does not label the data
In the face of non-linearly separable data information, direct classification will cause some data at the boundary to be lost, and the completeness of the data set will be lost.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Isolated forest-based binary classification abnormal point detection method and information data processing terminal
  • Isolated forest-based binary classification abnormal point detection method and information data processing terminal
  • Isolated forest-based binary classification abnormal point detection method and information data processing terminal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0065] The present invention is to solve the problem of abnormal point detection in the case of large data volume and high dimension; the isolated forest anomaly detection algorithm is a relatively complex algorithm with relatively high requirements for computing resources, whether it is for computing time or There are high requirements for memory space. With the development of modern computer technology, the amount of data has increased rapidly, and the outlier detection process often involves huge data arrays and large-scale data operations, which puts forward higher requirements for the efficiency of outlier detection al...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of communication control and communication processing, and discloses an isolated forest-based binary classification abnormal point detection method and aninformation data processing terminal. The method comprises the steps of carrying out initial static average blocking on an original data set, and calculating the density in the block and the mean density; after calculating the density in each block of the static block, reducing the data set by taking the mean density of the original data set as a threshold value; constructing an isolated forest byusing a node recursion method; performing corresponding feature extraction and datamation on the original data set, and calculating the spatial position distances between the clustering center pointand other points; adding the abnormal score calculated on the basis of the density and the distance and the abnormal score calculated on the basis of the proof information and comparing with a corresponding threshold value. According to the method, the accuracy of an abnormal point detection algorithm is effectively improved, the actual data size in the abnormal detection process can be greatly reduced, the calculation resources are saved, and the abnormal detection efficiency is improved, and the robustness of an abnormal detection algorithm is enhanced.

Description

technical field [0001] The invention belongs to the technical field of communication control and communication processing, and in particular relates to a method for detecting abnormal points based on isolated forest binary classification and an information data processing terminal. Background technique [0002] At present, the closest existing technology: Among the commonly used outlier detection algorithms, there are many classic algorithms, which cut into anomaly detection from different angles, and the anomaly detection of a class of support vector machines based on neural networks uses points and points The calculation of the space Euclidean distance between can get the minimum interval, so as to determine the corresponding support vector, and then maximize the distance between the two support vectors through the objective function under the constraints, so that the separation hyperplane can be determined to achieve Purpose of anomaly detection. Of course, the above met...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/214G06F18/2411G06F18/24323
Inventor 李孝杰李俊良史沧红吕建成吴锡周激流刘书樵张宪
Owner CHENGDU UNIV OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products