HBase database-based data batch loading method and device
A technology of batch warehousing and database, which is applied in the field of HBase database, can solve the problems of long time-consuming and low efficiency of batch data warehousing, and achieve the effect of improving the efficiency of HBase batch warehousing, increasing the generation speed and
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0043] The core idea of the present invention is: aiming at the key link in the existing HBase data that restricts the batch storage of the HBase database, by using the source data to be stored to extract the row key and carry out the average partition according to the number of partitions specified, the data partition The scope is effectively divided to avoid data skew and cross-partition problems in the process of generating HFile files.
[0044] figure 1 It is a flow chart of a method for batch data storage based on HBase database provided by an embodiment of the present invention, see figure 1 , when there is no HBase table in the HBase database, this data batch storage method of the present invention comprises:
[0045] Step S110, extracting and sorting the row keys of the source data to be put into storage, and performing average partitioning of the sorted row keys according to the specified number of partitions, and determining the row keys corresponding to the end v...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 