A load balancing method and device for solving spark data skew problem
A load balancing and data technology, applied in multi-programming devices, electrical digital data processing, program control design, etc., can solve the problems of redundant task occupation and increase of the total completion time of the operation.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0060] The preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.
[0061] Such as figure 2 As shown, a load balancing method for solving the Spark data skew problem described in this embodiment includes the following six steps:
[0062] S101. Monitor the average CPU utilization rate and memory utilization rate of the computing nodes, and initialize the weight information of the Executor after the Spark Executor process starts;
[0063] S102. Each computing node samples the local intermediate data according to the sampling ratio, which is set individually by the user, and then the computing node sends the local sampling information to the Master node through message communication;
[0064] S103. The Master node summarizes the sampling information of all computing nodes, and then establishes a histogram of data distribution according to the sampling ratio, and predicts the overall characteristics of the da...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More - R&D
- Intellectual Property
- Life Sciences
- Materials
- Tech Scout
- Unparalleled Data Quality
- Higher Quality Content
- 60% Fewer Hallucinations
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2025 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com



