On-line attribute abnormal point detecting method for supporting dynamic update

A technology of dynamic updating and detection methods, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of lack of technical solutions, inability to provide corresponding processing and storage capabilities, and achieve the effect of ensuring result errors

Inactive Publication Date: 2010-12-08
ZHEJIANG UNIV
View PDF2 Cites 43 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since streaming data systems often need to detect a large amount of dynamic data in real time, and in most cases the actual deployment machine cannot provide corresponding processing and storage capabilities, it is necessary to provide an efficient detection method
[0005] Outlier detection in streaming data systems is a new research field and has important practical value, but there is still a lack of a mature technical solution. Therefore, it is necessary to provide an online attribute outlier detection method that can support dynamic updates. Under the premise of accuracy, it can process massive dynamic data in streaming data systems through approximate methods

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • On-line attribute abnormal point detecting method for supporting dynamic update
  • On-line attribute abnormal point detecting method for supporting dynamic update
  • On-line attribute abnormal point detecting method for supporting dynamic update

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] The technical solution of the present invention will be further described in conjunction with specific implementation and examples.

[0039] 1. If figure 1 and figure 2 Shown, the specific implementation process and working principle of the present invention are as follows:

[0040] 1) Select a data model that meets the requirements of the streaming data system in terms of data generation, detection methods, and user needs to maintain continuously and dynamically updated streaming data;

[0041] 2) Use the online clustering method to continuously and dynamically cluster the stream data, and realize the cluster division based on the correlation of data attributes;

[0042] 3) Dynamically maintain the cluster division under flow data update, and continuously update the neighbor relationship and distance information between data points, maintain cluster related information online, and maintain the relevant summary information structure with the dynamic update of data; ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an on-line attribute abnormal point detecting method for supporting dynamic update. The method comprises the following steps of: providing brand-new attribute abnormal point definition by analyzing practical application and user requirement, detecting an abnormal point on the premise of taking an attribute correlation between data points in a data set into consideration, providing more effective abnormal information relative to the traditional definition, and supporting on-line attribute abnormal point detection for dynamically updated stream data by combining practical stream data system application and using a sliding window and an on-line clustering method so as to provide real-time detection result feedback for a user. Aiming at the practical system overload condition in the stream data system application, a method for effectively reducing the load is provided so that the detecting method still can feed back the detection result in real time under the mass stream data update condition, the result error is in a user controlled range and effective balance of the detecting method between the running efficiency and the result precision is achieved.

Description

technical field [0001] The invention relates to data mining, outlier point detection and stream data system management technology, in particular to an online attribute outlier point detection method supporting dynamic update. Background technique [0002] Outlier detection is one of the most important application technical methods in the field of data mining. Different from other common data mining methods, outlier detection is to find relatively isolated and outlier abnormal points and abnormal patterns in massive data. Most of the early data mining systems only regard the abnormal points in the data as noise, and the detection of abnormal points is mainly used to clean the noise. However, with the generation of massive data, abnormal data can often reflect greater value, so the application of abnormal point detection in reality is becoming more and more extensive, such as network intrusion detection, bank credit fraud, etc. [0003] One of the first prerequisites for outl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 陈刚寿黎但胡天磊陈珂曹晖
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products