Unlock instant, AI-driven research and patent intelligence for your innovation.

Outlier data discovery method and system based on low water level sliding time window

A sliding time window and outlier data technology, which is applied in digital transmission systems, transmission systems, data exchange networks, etc., can solve the problems of not giving outlier data judgment method and inability to strictly distinguish outlier data, so as to reduce data The effect of dealing with the number of misjudgments of fault recovery, improving reliability, and accelerating fault recovery

Active Publication Date: 2020-05-05
UNIV OF JINAN
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This method provides a judgment method for missing data, but does not give a judgment method for outlier data, and only using time points to indicate low water levels cannot strictly distinguish outlier data
The existing technology Trident avoids outlier data through the strict and orderly requirements of the data to be processed. This method relies on the transaction framework and generates a lot of additional overhead.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Outlier data discovery method and system based on low water level sliding time window
  • Outlier data discovery method and system based on low water level sliding time window
  • Outlier data discovery method and system based on low water level sliding time window

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The present invention will be further described below in conjunction with the accompanying drawings and embodiments.

[0048] Such as figure 1 As shown, the stream processing network topology includes data distribution 101 , various data processing 102 , 103 , 104 , 105 and data aggregation 106 . The data distribution 101 is used to receive external data streams and forward them to various subsequent data processing. The data processing 102, 103, 104, and 105 are calculation units of stream processing. The data aggregation 106 is to summarize and output the results of data processing. Data streams from different keywords can be executed concurrently on different data processing nodes.

[0049]The low water level is defined based on the data flow between data processing, and identifies the timestamp of the earliest unprocessed data packet in the current data processing, so as to ensure that the current data processing will no longer generate data packets with earlier ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an outlier data discovery method and system based on a low-water mark sliding time window. The method comprises the following steps: (1) data distribution, that is, external data streams are received and then distributed to various data processing nodes; (2) data processing, that is, the data processing nodes process the received external data streams; the low-water mark sliding time window is defined, a time stamp is used as a horizontal coordinate axis, and the low-water mark sliding time window constantly moves from left to right on the time stamp horizontal coordinate axis as time passes by; at any time point, data in the low-water mark sliding time window above the horizontal coordinate axis is unprocessed data, and data in the low-water mark sliding time window below the horizontal coordinate axis is processed data; and whether currently processed data is outlier data or not is discovered according to a position of a current data processing time stamp in the low-water mark sliding time window; and (3) data aggregation, that is, data processing results are aggregated and output. The method and system has the advantages that discardable data, the outlier data and data to be normally processed can be distinguished, so that the data processing reliability is improved, and failure recovery is accelerated.

Description

technical field [0001] The invention relates to a method for discovering outlier data, in particular to a method and system for discovering outlier data based on a low water level sliding time window. Background technique [0002] Stream processing is the real-time computation of constantly changing streams of data. In order to meet the challenges brought by users' real-time processing of massive data, and to solve the bottleneck problem of real-time processing represented by traditional MapReduce batch processing, the emerging stream processing method can be used in risk management, marketing management, advertising, social recommendation, etc. All aspects have important application value. [0003] The source of data for stream processing is due to network delays, internal concurrency of the system, etc., and the same type of data cannot be guaranteed to arrive at the data processing node in strict order of time stamps, and there are outlier data that are inconsistent in t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): H04L12/26H04L12/24
CPCH04L41/06H04L43/0852H04L43/16
Inventor 马坤周劲于自强纪科
Owner UNIV OF JINAN