Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Real-time ETL dataflow conversion processing technical method and system

A technology for conversion processing and data flow, which is applied in electrical digital data processing, special data processing applications, instruments, etc. It can solve problems such as inability to dynamically adjust resources, standardization, and inability to real-time effective data, and achieve flexible configuration and guarantee stability. Effect

Inactive Publication Date: 2018-04-13
上海中畅数据技术有限公司
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical problem to be solved by the present invention is: in order to solve the problem of not being able to effectively standardize and convert data in real time in the field of big data conversion, and there is a single point problem of wasting resource occupation; at the same time, since JStorm tasks are all hard-coded, they cannot be changed in real time and cannot be dynamically adjusted Resource problem, the present invention provides a kind of real-time ETL data stream conversion processing technical method, includes each step of processing method, provides a kind of topological map configuration, internal converter turns into SQL-like, configuration is flexible; And in network interruption, machine downtime It can still work in the case of a computer, ensuring the stability of data conversion; at the same time, the internal processing module provides a dynamic configuration conversion method, which effectively solves the problem of not being able to standardize and convert data in real time and effectively in the field of big data conversion. There is a single point problem that wastes resource occupation; at the same time, because JStorm tasks are hard-coded, they cannot be changed in real time and cannot dynamically adjust resource issues

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time ETL dataflow conversion processing technical method and system
  • Real-time ETL dataflow conversion processing technical method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention is described in further detail now in conjunction with accompanying drawing. These drawings are all simplified schematic diagrams, which only illustrate the basic structure of the present invention in a schematic manner, so they only show the configurations related to the present invention.

[0023] A kind of real-time ETL data stream conversion processing technical method, comprises following concrete steps:

[0024] 1) Data standardization is performed first, and various log data are standardized into log templates, and system performance indicators and business indicators are standardized into indicator templates. Templates are a form of presentation of time series, mainly including timestamps, dimensions, measurement values, and added values. etc. Data diversification uses standardized data to provide a basis for transformation;

[0025] 2) Put the above-mentioned standard data Avro serialization into the kafka message queue;

[0026] 3) The ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of data processing, and in particular relates to a real-time ETL dataflow conversion processing technical method and a real-time ETL dataflow conversion processing technical system. The processing method comprises steps, provides topological graph configuration, and converts the same into analogue-SQL by using an internal converter, so that configuration is flexible; furthermore, work can still be performed under the condition that a network is interrupted and a machine goes down, so that stability of data conversion is ensured; meanwhile an internal processing module provides a dynamic configuration converting mode, so that the problems that in the field of big data conversion, at present, the data cannot be effectively standardized and converted in real time, resources are wasted due to a single point problem, and meanwhile, a JStorm task is hard codes and cannot be changed in real time and thus the resources cannot be dynamically adjustedare solved.

Description

technical field [0001] The invention relates to the technical field of big data real-time stream processing, in particular to a real-time ETL data stream conversion processing technology method and system. Background technique [0002] ETL is the process of describing the source after extraction and converting it to the target end, which is used to construct the data warehouse. JStorm is a distributed real-time computing engine, a system similar to Hadoop MapReduce. The user implements a task according to the specified programming specification, and then submits the task to The JStorm system runs 7*24 hours. Once a worker in the middle fails unexpectedly, the scheduler immediately assigns a new worker to replace the failed worker. Therefore, from an application point of view, a JStorm application is a distributed application that complies with a certain programming specification. From a system perspective, JStorm has a scheduling system similar to MapReduce. From the persp...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/30
CPCG06F16/254
Inventor 朱志刚朱明磊
Owner 上海中畅数据技术有限公司
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More