Dimension modeling method, dimensional modeling apparatus, device and storage medium

A dimensional modeling and dimensional technology, applied in the field of devices, equipment and storage media, and dimensional modeling methods, can solve problems such as enlargement, reduced query efficiency of map-reduce jobs, and excessive joins, and achieve the effect of avoiding field expansion.

Inactive Publication Date: 2018-10-12
WUBA
View PDF6 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, after the amount of data rises to the tb / pb level, changes in the amount of data and changes in the computing and storage methods based on the Hadoop system magnify the shortcomings of these modeling methods that require too many joins (associated queries)
Specifically, take the join of table A and table B through the a1 field of table A and the b1 field of table B on hive as an example. This query will be converted into a hadoop map-reduce job. At the same time, every time one more table is joined For example, if the a2 field of table A is joined with the c1 field of table C, there will be one more map-reduce job, and the increase in the number of map-reduce jobs will directly lead to a decrease in query efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dimension modeling method, dimensional modeling apparatus, device and storage medium
  • Dimension modeling method, dimensional modeling apparatus, device and storage medium
  • Dimension modeling method, dimensional modeling apparatus, device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0052] like figure 1As shown, the traditional dimensional modeling theory divides the data model into a fact table and a dimension table, records the events of the business process through the fact table, and records the environment in which the business process occurs through the dimension table. In the dimension table, it is necessary to generate a unique surrogate key for each dimension value, and realize the processing of slowl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a dimension modeling method, a dimension modeling apparatus, a device and a storage medium. The method comprises the following steps: according to service needs, constructing alogic model of a fact table and a dimension table; according to the logic model, constructing a physical model of the fact table and the dimension table, wherein specified dimension data is denormalized to the fact table through a set composite field when the physical model of the fact table is constructed. According to the method, the apparatus, the device and the storage medium provided by theinvention, the dimension data is denormalized to the fact table, which is convenient for directly inquiring the fact table without performing associated inquiry while performing data inquiry. Moreover, when the dimension data is denormalized to the fact table, storage and recording are executed according to the set composite field, thus, field expansion caused after redundancy is avoided.

Description

technical field [0001] The present invention relates to the field of data modeling, in particular to a dimensional modeling method, device, equipment and storage medium. Background technique [0002] Data warehouse modeling methods are usually divided into four types: 3NF (Third Normal Form, third normal form), dimensional modeling, data-vault, and anchor. In the traditional RDBMS (Relational Database Management System, relational database management system) scenario, these four model design methods can cover most application scenarios, and different modeling methods provide convenience from different angles for data storage and analysis. [0003] However, after the amount of data rises to the tb / pb level, changes in the amount of data and changes in the computing and storage methods based on the Hadoop system have magnified the shortcomings of these modeling methods that require too many joins (associated queries). Specifically, take the join of table A and table B through...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 黄世钱吴培培
Owner WUBA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products