Unlock instant, AI-driven research and patent intelligence for your innovation.

Data processing method, apparatus and system

A data processing device and data processing technology, applied in the field of data processing, can solve problems such as dependency deadlock, performance degradation of big data analysis, frequent read and write operations, etc.

Active Publication Date: 2017-03-08
HUAWEI TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] Embodiments of the present invention provide a data processing method, device, and system, so as to avoid the problem of dependency deadlock when analyzing and processing big data based on ETL technology, resulting in too frequent IO read and write operations on the disk or memory, resulting in large The problem that the performance of data analysis is greatly reduced

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method, apparatus and system
  • Data processing method, apparatus and system
  • Data processing method, apparatus and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The following will clearly and completely describe the technical solutions in the embodiments of the present invention in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0051] The embodiment of the present invention provides a data processing method and device to avoid the dependency deadlock problem in the prior art when analyzing and processing big data based on ETL technology, which causes too much IO read and write operations on disk or memory. Frequently, the performance of big data analysis is greatly reduced. Wherein, the method and the device are based on the same inventive concept, and since the princ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data processing method, apparatus and system to avoid the problem of excessively frequent IO reading-writing operations on a disk or a memory caused by the problem of dependency on deadlock during big data analysis processing based on an ETL technology. The method comprises the steps of determining nodes, which meet the condition, in all nodes comprised in an ETL system; for each determined node which meets the condition, selecting part of non-blocking nodes from all non-blocking nodes existent on a transmission path passed by a non-blocking data source received by the node, and modifying the selected non-blocking nodes to blocking nodes; and / or storing the non-blocking data source received by the node in the local side of the node. By adopting the method, the problem of dependency on the deadlock state due to the data analysis processing based on the ETL technology can be solved with relatively low performance loss, so that the problem of greatly reduced big data analysis performance due to the excessively frequent IO reading-writing operations on the disk or the memory can be avoided.

Description

technical field [0001] The present invention relates to the technical field of data processing, in particular to a data processing method, device and system. Background technique [0002] Data extraction-transform-load (Extract-Transform-load, ETL) is used to implement the process of extracting, transforming, and loading the data to be analyzed from the source to the destination. ETL is more commonly used in data warehouses. As an important part of building a data warehouse, users extract the required data from the data source, after data cleaning, and finally load the data into the data warehouse according to the predefined data warehouse model. [0003] The system based on ETL technology includes three types of nodes for data extraction, data conversion and data loading. Each node is used to complete different functions, and each node is connected by a connection. Represents the specific data flow direction, and nodes with different functions are logical nodes used to com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 朱玉麒
Owner HUAWEI TECH CO LTD