Data processing method and relevant equipment

A data processing and data technology, applied in the field of information processing, can solve the problems of not providing, the MapReduce architecture does not provide judgment methods, reduce the execution efficiency of slave node devices, etc., and achieve the effect of improving processing efficiency and optimizing the MapReduce architecture

Inactive Publication Date: 2015-06-24
HUAWEI TECH CO LTD +1
View PDF3 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The technical personnel of the present invention finds when realizing above-mentioned scheme, because no analytic function is provided in the MapReduce architecture, when analyzing the value of key-value pair, need to rely on the corresponding program written by the programmer; The size of the buffer for the correct value may not be consistent with the size of the buffer allocated by the GPU to store data, and the MapReduce architecture does not provide a corresponding judgment method, which also depends on the corresponding judgment function written by the programmer. Whether it is consistent to judge and reduce the execution efficiency of the slave node device

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data processing method and relevant equipment
  • Data processing method and relevant equipment
  • Data processing method and relevant equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings of the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0058] Embodiments of the present invention provide a data processing method and related equipment, which are applied to Hadoop clusters under the MapReduce architecture, realize Hadoop slave node equipment data format automatic conversion and data automatic splicing, simplify the programming work of programmers, and facilitate subsequent optimization of MapReduce architecture.

[0059] Such as figure 1 As shown, the present invention provides a data processing me...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A data processing method and a related device, implementing automatic conversion of data format and automatic splicing of data in a node device by a Hadoop. The method mainly comprises: a data preprocessor reads metadata from a first buffer area of a CPU, reads data of a data collection from the first buffer area on the basis of a memory address indicated by the metadata, converts, on the basis of a preset analytic function, the data of the data collection into a data format indicated by the preset analytic function, and stores data blocks generated with the converted data collection in a second buffer area of the CPU, thus allowing a data splicer to read from the second buffer area the data blocks and to splice same to a GPU.

Description

technical field [0001] The invention relates to the technical field of information processing, in particular to a data processing method and related equipment. Background technique [0002] Together with cloud computing, big data has brought a new revolution to information technology (IT, Information Technology). Cloud computing has powerful big data computing capabilities and very fast computing speed, but the transmission of big data has become a major problem. [0003] MapReduce (there is no unified Chinese translation in this field) is a famous cloud computing architecture provided by Google search engine Google, which is used for parallel computing on large-scale data sets (greater than 1TB). Hadoop (there is no unified Chinese translation in this field) Chinese translation) is the specific implementation of the MapReduce architecture, which is divided into master node devices and slave node devices in the Hadoop cluster. Among them, the Map function provided by MapRed...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F9/44G06F9/50H04L29/08
CPCG06F9/541
Inventor 崔慧敏谢睿阮功杨文森
Owner HUAWEI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products