Data parallel processing method, device and system
A parallel processing and data technology, applied in the field of data processing, can solve the problems of increasing storage space, increasing the extra overhead of starting and stopping tasks multiple times, increasing the difficulty of data consistency check, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0037] In order to enable those skilled in the art to better understand the solution of the present invention, the present invention will be further described in detail below with reference to the accompanying drawings and embodiments.
[0038] The data parallel processing method, device and system of the embodiments of the present invention are applied to Hadoop's parallel computing framework MapReduce. In order to better understand the scheme of the present invention, first, a brief description of the processing flow of MapReduce in the prior art is given.
[0039] In the description of the following embodiments, the file stored on the collection server is referred to as a local file.
[0040] Such as figure 1 As shown, it is a typical processing flow of MapReduce in the prior art, where:
[0041] The Map task reads the data to be processed through the corresponding input source class, and after the data is converged / aggregated, the Reduce task outputs the data through the correspond...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 