Spark partition load balancing method
A load balancing and sub-partitioning technology, which is applied in the field of big data, can solve the problems of long application program running time and shorten application completion time, etc., and achieve the effects of shortening completion time, alleviating data skew, and partition load balancing
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0028] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.
[0029] see figure 1 , the embodiment of the present invention provides a load balancing method, including
[0030] S1. After starting the Map task, obtain and count the operation information through the partition monitor, and obtain the operation statistics;
[0031] S2. After obtaining the operation statistics information, use the partition size predictor to calculate the amount of intermediate data generated by each partition after 100% of the mapping task is co...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


