Method and device for detecting abnormal subsequence in data sequences

A technology for detecting data and sub-sequences, which is applied in the direction of error detection/correction, electrical digital data processing, and special data processing applications. The effect of small calculation amount, reduced time required, and improved accuracy

Active Publication Date: 2015-07-29
IBM CORP
View PDF4 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In these traditional methods, when detecting whether some data in a time data sequence is abnormal, all the data of the sequence need to be used, and the entire time data sequence needs to be scanned multiple times during the detection period, which makes the calculation of the detection operation very large. and it takes a long time
In addition, since the distribution density of all data in the time data series (especially the time data series generated in a long period of time) often varies greatly in the mapping space, if some data are detected based on the distribution density of all data Whether it is abnormal, the normal data whose distribution density is quite different from other data may be identified as abnormal data, making the result inaccurate
Moreover, traditional methods can only perform offline (non-real-time) detection of time series data, rather than online (real-time) detection, which is unacceptable for some scenarios that hope to obtain detection results as soon as possible

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for detecting abnormal subsequence in data sequences
  • Method and device for detecting abnormal subsequence in data sequences
  • Method and device for detecting abnormal subsequence in data sequences

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Some preferred embodiments of the present disclosure are shown in the drawings and will be described in more detail below with reference to the drawings. However, the present disclosure can be implemented in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the scope of the disclosure to those skilled in the art.

[0019] Those skilled in the art know that various aspects of the present invention can be implemented as a system, method or computer program product. Therefore, various aspects of the present invention can be embodied in the following forms, that is: a complete hardware implementation, a complete software implementation (including firmware, resident software, microcode, etc.), or a combination of hardware and software implementations, These may collectively be referred to herein as "circuits," "modules," or "systems....

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method for detecting abnormal subsequences in data sequence includes constructing a hierarchical data structure of a target subsequence, each node in a bottommost layer of the data structure storing corresponding data of the target subsequence, and each node in a layer above the bottommost layer storing values based on data stored in corresponding nodes in a lower layer next to the layer above the bottommost layer; determining a second number of neighbors of the target subsequence based on the data structure of the target subsequence and of the first number of reference subsequences constructed in advance, the second number of neighbors having minimum Euclidean distances from the target subsequence; determining a third number of neighbors of each reference subsequence in the second number of reference subsequences, which have minimum Euclidean distances from each reference subsequence and determining whether the target subsequence is an abnormal subsequence.

Description

technical field [0001] The present invention relates to abnormal data detection, and more particularly to a method and device for detecting abnormal subsequences in a data sequence. Background technique [0002] In scenarios such as the Internet of Things (IOT) or Smart Earth, through a certain data generation mechanism, data can be continuously generated over time to form a time data sequence. For example, in a scenario where a detector is used to detect air pollutants, the detector continuously outputs data over time, thereby forming a time data sequence reflecting the air pollution level at each moment. In time data series, some data may deviate greatly from other data, such data can be called abnormal data. Since abnormal data can reflect certain problems in the data generation mechanism or some important states of the objects associated with the data, it is very important to detect abnormal data in time data series. [0003] Currently, various methods have been propos...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F11/0721G06F11/079G05B23/0232G06F11/0751
Inventor 刘凯陈垚亮陈晓艳黄胜王晨
Owner IBM CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products