Data out-of-order arrival processing method and system

A processing method and processing system technology, applied in the field of data out-of-order arrival processing methods and systems, to achieve the effect of improving processing capacity, improving effectiveness and timing

Active Publication Date: 2017-08-01
CHENGDU SEFON SOFTWARE CO LTD
View PDF5 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The purpose of the present invention is to overcome the deficiencies in the prior art, and provide a method and system for processing data out of order. The present invention uses the principle of time slice and batch distribution of real-time streaming data to make the Spark processing node store Redis process Logical allocation is carried out to solve the problem of out-of-order arrival of real-time streaming data and improve the validity and timing of data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data out-of-order arrival processing method and system
  • Data out-of-order arrival processing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0061] According to the principle of Redis storage, the present invention stores data in batches for invoking and querying through external interfaces. At present, a SDC Sream big data real-time streaming product has adopted the present invention to process data out of order, and has passed actual tests and applications. , the effect is very ideal. The technical storage solution of the present invention is further described in detail below in conjunction with the accompanying drawings:

[0062] Such as figure 1As shown, in the flow chart of the storage steps in the present invention, it mainly includes several steps such as streaming data reception, data conversion, data time segment judgment, and data storage to Redis. In this embodiment, the logic judgment of data storage is completed through Java code combined with Spark scheduling and Redis storage:

[0063] Step 1: Receive streaming data and transmit it to the out-of-order arrival processing module;

[0064] Step 2: Pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data out-of-order arrival processing method and system. The method comprises the following steps of extracting a current time window field, and processing the time window field into date type data; judging whether user stream data has a specified time window field or not, and performing corresponding processing; marking a time window field where a time slicing field is located, and extracting a data set before the time window field from a Redis storage library; judging whether time slicing data of the marked time window field is in the extracted data set or not, and performing corresponding processing; and storing the user stream data in the Redis storage library, and updating the Redis storage library. The system comprises a data processing module, a first judgment module, a marking module, a second judgment module and a Redis storage library module. According to the method and the system, the problem of out-of-order arrival of real-time stream data is solved; the method and the system are especially suitable for solving the problem of a non-serialization scene of a data source; and the validity and time sequence of the data are improved.

Description

technical field [0001] The invention relates to the technical field of big data analysis and processing, in particular to a data out-of-order arrival processing method and system. Background technique [0002] In the context of the current big data industry, real-time streaming technology is a data processing technology that pushes batched, orderly, and neat serialized data to the analyzer in a fixed manner. Since the analyzer has strict requirements on the data format, this directly leads to the fact that in most cases, the data format is single and the serialization requirements are strict. However, in real-time streaming data sources, the data often does not come from highly serialized scenarios. Due to the out-of-order arrival of data, the data cleaning results are often inconsistent with the original data results, the data timing is poor, and the data quality is low. Contents of the invention [0003] The purpose of the present invention is to overcome the deficienci...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/22
Inventor 李广王纯斌曹洹太覃进学刘旻哲
Owner CHENGDU SEFON SOFTWARE CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products