Distributed data processing method, device and system
A technology of distributed data and processing methods, applied in the field of data processing, can solve problems such as large network traffic, achieve the effects of avoiding system resources, improving the effect of distributed data processing, and avoiding data migration
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0028] figure 1 It is a flow chart of the distributed data processing method provided by Embodiment 1 of the present invention. Such as figure 1 As shown, the distributed data processing method provided in this embodiment is specifically applied to the data processing process of the Map-Reduce system, and the Map-Reduce system may specifically include a master node and at least two working nodes. The distributed data processing method provided in this embodiment may be executed by a distributed data processing device, and the distributed data processing device may be a master node, and may be implemented by means of software and / or hardware.
[0029] The distributed data processing method provided in this embodiment specifically includes:
[0030] Step 10, the master node generates a mapping Map task according to the obtained upload node indication information and the task acquisition request sent by the work node, wherein the upload node indication information includes addr...
Embodiment 2
[0039] image 3 It is a flow chart of the distributed data processing method provided by Embodiment 2 of the present invention. Such as image 3 As shown, the distributed data processing method provided by this embodiment is in figure 1 On the basis of the illustrated embodiment, in step 10, before the master node generates a mapping Map task according to the obtained upload node indication information and the task acquisition request sent by the working node, the following steps may be specifically included:
[0040] Step 30, the master node generates file division indication information and the upload node indication information according to the file information sent by the client, and sends the file division indication information and the upload node indication information to the client, so that the client divides the file to be processed into a plurality of data blocks according to the file division indication information, and sends each data block to a corresponding wor...
Embodiment 3
[0060] Figure 7 It is a schematic structural diagram of a distributed data processing device provided in Embodiment 3 of the present invention. Such as Figure 7 As shown, the distributed data processing device provided in this embodiment can specifically implement each step of the distributed data processing method provided in any embodiment of the present invention, which will not be repeated here.
[0061] The distributed data processing device provided in this embodiment specifically includes a task generating unit 11 and a task allocating unit 12 . The task generating unit 11 is configured to generate a mapping Map task according to the obtained upload node indication information and the task acquisition request sent by the working node, wherein the upload node indication information includes the addresses of the work nodes corresponding to the plurality of data blocks respectively, and the The data blocks corresponding to the Map task are distributed on the working node...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 