Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Metric space outlier detection method based on various supporting points

A detection method and space measurement technology, which can be used in other database retrieval, special data processing applications, instruments, etc., can solve the problems of long index establishment time, reduce the number of distance calculations, improve the speed of outlier detection, and avoid data space distortion Effect

Inactive Publication Date: 2017-12-15
FOSHAN UNIVERSITY
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The above-mentioned HIOD algorithm overcomes the shortcomings of the iORCA algorithm, but this algorithm only selects one support point to achieve the goal of dense support points and edge support points at the same time, and it takes a long time to build an index

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Metric space outlier detection method based on various supporting points
  • Metric space outlier detection method based on various supporting points
  • Metric space outlier detection method based on various supporting points

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0039] The concept, specific structure and technical effects of the present invention will be clearly and completely described below in conjunction with the embodiments and accompanying drawings, so as to fully understand the purpose, features and effects of the present invention. Apparently, the described embodiments are only some of the embodiments of the present invention, rather than all of them. Based on the embodiments of the present invention, other embodiments obtained by those skilled in the art without creative efforts belong to The protection scope of the present invention.

[0040] In order to better illustrate the technical solution created by the present invention, the technical terms in the technical field that appear in the description are now explained:

[0041] Outlier degree: The outlier degree of an object indicates the degree of its outlier. The average value of its distance from the k-nearest neighbor is often used as the outlier degree, or the distance v...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a metric space outlier detection method based on various supporting points. The method comprises the steps of selecting a distance function, the distance function is selected according to the data type of a data set; the step of selecting the supporting points, wherein the data set is read, and the non-repeated intensive supporting points and the edge supporting points are selected; the step of establishing an index, wherein the distances between all objects and the dense supporting points are calculated and sequenced to form the one-dimensional index, the distances between all the objects and the edge supporting points are calculated and marked as second distances, first distances and the second distances serve as coordinates, and a supporting point space is formed; the step of conducting outlier detection, wherein the one-dimensional index is divided into a plurality of data blocks, and outlier detection is performed on the data blocks one by one. Due to the fact that the dense supporting points and the edge supporting points are selected at the same time, the technical problem that the data space is distorted due to a single supporting point is avoided, meanwhile, the time and cost for establishing the index are reduced, the distance calculation frequency is reduced, and the outlier detection speed is increased. The method is used for detecting outlier points in the data set.

Description

technical field [0001] The invention relates to the field of data mining, in particular to a metric space outlier detection method based on multiple support points. Background technique [0002] Outliers are unusual data points in a data set that behave so differently from other points that one suspects that these data are not random deviations, but are produced by an entirely different mechanism. Outliers are also called abnormal points or abnormal objects. Outlier detection is also called anomaly detection, deviation detection or outlier mining, which is to detect outliers in the data set according to a certain algorithm. In other words, outlier detection is to mine the very few points in the massive data that are significantly different from the mainstream data. [0003] Most of the traditional outlier detection techniques are oriented to multi-dimensional space, which are only applicable to multi-dimensional data, and are helpless for complex data types such as images,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/901G06F16/90335
Inventor 许红龙戎海武何敏藩文翰杨勇
Owner FOSHAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products