Tuple tracking method and tuple tracking system for data flow processing

A technology for tracking systems and data streams, applied in electrical digital data processing, special data processing applications, digital transmission systems, etc. Reliability, load balancing, and memory overhead savings

Active Publication Date: 2013-10-09
INST OF INFORMATION ENG CAS
View PDF3 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The time overhead of upstream backup is less, only the upstream server needs to back up the output tuple in the output queue, but the recovery time of failure is also longer
[0007] However, the above backup technology is only applicable to the case where the failure granularity is the server. When the server does not fail, but the tuples processed by it are lost due to memory constraints and o

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Tuple tracking method and tuple tracking system for data flow processing
  • Tuple tracking method and tuple tracking system for data flow processing
  • Tuple tracking method and tuple tracking system for data flow processing

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] The principles and features of the present invention are described below in conjunction with the accompanying drawings, and the examples given are only used to explain the present invention, and are not intended to limit the scope of the present invention.

[0070] Tuple: The basic data structure that makes up a data stream. A tuple is a list composed of some Values. Value can be of any type, such as integer, byte, character, bit array, floating point, double, bit, short integer, long integer, Boolean Etc., also can be a custom serializable type.

[0071] Tuple state: The state of the tuple processing process is divided into the following three types: the processing state pending, the processing failure state failure, and the processing success state finish.

[0072] Numeric ring: from 0 to 2 32 An end-to-end numeric space consisting of -1 values.

[0073] Data Stream: A data stream is an unbounded sequence of tuples.

[0074] Spring: tuple generator, a component th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a tuple tracking method and a tuple tracking system for data flow processing. The tuple tracking system comprises a tuple generator, a tupler tracker and a plurality of tuple processors. The tuple generator generates root tuples, processes the root tuples, and generates new tuples, and then transmits the new tuples to the different tuple processors. The tuple processors process the received tuples and generate new tuples. Each root tuple can generate a tuple tree after being processed, in the generation process of each tuple tree, the tuple generator transmits relevant information of the root tuple to the tuple tracker, then the tuple tracker can construct a tracking record of the root tuple, and the tuple tracker chooses one tuple tracking unit for each root tuple. When each tuple processor processes the tuple, relevant information of the tuple is sent to the tuple tracker to update identification positions of the tracking records. The tuple tracking method and the tuple tracking system for data flow processing can greatly reduce internal storage cost, achieve load balance of the tuple tracking units, and improve reliability of tuple processing.

Description

technical field [0001] The invention relates to the field of distributed data flow processing, in particular to a data flow processing-oriented tuple tracking method and system that ensures that each tuple that needs to be processed in the data flow will not be lost and cannot be processed. Background technique [0002] With the rise of technologies such as cloud computing and the Internet of Things, data is growing and accumulating at an unprecedented rate, and more and more appear in applications in the form of large-scale and continuous streams. The most typical application Monitoring applications, such as financial market monitoring, network monitoring, mobile object monitoring, intrusion detection, and ecosystem monitoring, etc., because these applications monitor real-time data, so the value of the data will continue to decrease over time , so low-latency processing is a key requirement for this type of application. For this reason, many data stream processing systems ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04L12/24H04L29/08G06F17/30
Inventor 张鹏杜华明徐克付张闯谭建龙
Owner INST OF INFORMATION ENG CAS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products