Data parallel sequencing method and system
Patent Information
- Authority / Receiving Office
- CN Β· China
- Current Assignee / Owner
- BEIJING QIHOO TECH CO LTD
- Publication Date
- 2014-01-22
- Estimated Expiration
- Not applicable Β· inactive patent
Smart Images
Figure 1 Figure 2
Abstract
Description
technical field
[0001] The invention relates to a data processing method and system, in particular to a data parallel sorting method and system. Background technique
[0002] Global sorting of data in large-scale data processing is a common operation, such as PageRank calculation. Traditional sorting algorithms can be divided into inner sorting and outer sorting. Among them, internal sorting includes insertion sorting, quick sorting, etc., and all data needs to be loaded into the memory for calculation. When the data to be sorted is large-scale data, the stand-alone memory becomes the bottleneck. Outer sorting is mainly based on the sorting algorithm of multi-way merge, which can handle large-scale data, but the speed is slow. At present, in PageRank calculation, a global sorting of the final calculation results is required, and the data scale is hundreds of GB. Considering the calculation scale, a parallel mechanism needs to be introduced. However, the existing parallel ...