Mass data sorting method and device based on Spark, equipment and storage medium
A technology of massive data and sorting method, applied in the field of big data, can solve the problems of data skew and affect server performance, and achieve the effect of avoiding data skew and improving performance
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0069] The embodiment of the present invention provides a Spark-based mass data sorting method, device, equipment, and storage medium, by distributing samples of any group to each partition with equal probability, and performing global group sorting with the assistance of an external storage medium, thereby avoiding Data skew.
[0070] The terms "first", "second", "third", "fourth", etc. (if any) in the description and claims of the present invention and the above drawings are used to distinguish similar objects, and not necessarily Used to describe a specific sequence or sequence. It is to be understood that the terms so used are interchangeable under appropriate circumstances such that the embodiments described herein can be practiced in sequences other than those illustrated or described herein. Furthermore, the term "comprising" or "having" and any variations thereof, are intended to cover a non-exclusive inclusion, for example, a process, method, system, product or devic...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


