A Hive-based hierarchical design method of a university data warehouse
A data warehouse and design method technology, applied in the database field, can solve the problems of poor flexibility and the inability to realize incremental + full data synchronization in the three-tier design framework, and achieve the effect of strong scalability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] Such as figure 1 , is a university data warehouse framework, the whole framework is divided into four layers, namely data source, data storage layer, data analysis layer and data application layer.
[0049] The data sources include data from various systems of the school, and the format includes structured tables and unstructured log data;
[0050] ETL tools, such as Sqoop tools or open source kettle, clean, convert, and load data from data sources to the Hadoop distributed platform, use Hdfs (distributed file system) distributed storage, and Hive distributed processing;
[0051] Through the Hive tool, the data of the data storage layer is established as a data warehouse, that is, a data analysis layer. The data warehouse is divided into an ODS data storage layer, a DWD data detail layer, a DW data summary layer, and a DWA data application layer;
[0052] Among them, the ODS data storage layer is a data cache layer, which is used to store the acquired original data, re...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 
