An optimization method for large and small table association in hive
An optimization method and correlation analysis technology, applied in the field of big data processing, can solve problems such as low efficiency, and achieve the effect of improving efficiency and reducing the amount of data
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] Such as figure 1 with figure 2 An optimization method for large and small table association in hive is shown, including the following steps:
[0031] Step 1: Establish a server cluster composed of multiple servers, and establish a Hadoop framework structure on the basis of the server cluster;
[0032] Step 2: build the hive data warehouse tool on the Hadoop framework structure, the Hive data warehouse tool provides an HQL interface externally, and the Hive data warehouse tool maps large-scale data sets stored on HDFS or other storage media into data tables, and the data tables According to the size of the data volume, it is divided into large data table and small data table;
[0033] Step 3: The Hive client completes the analysis of the data table with the help of Mapreduce at the bottom layer of the Hive data warehouse tool;
[0034] Step 4: Using the MapReduce computing framework as the execution engine of hive, the hive client executes multi-table association tas...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


