Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for intelligently dividing data themes, equipment and storage medium

A data and topic technology, applied in the field of intelligently dividing data topics, can solve time-consuming and labor-intensive problems, and achieve the effects of reducing development time, quickly locating, and intuitively mining model information

Pending Publication Date: 2021-11-26
BEIJING XUEZHITU NETWORK TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Aiming at the time-consuming and laborious technical problem of manually writing sql to complete dimension modeling, the present invention proposes a method, system, device and storage medium for intelligently dividing data subjects

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for intelligently dividing data themes, equipment and storage medium
  • Method and system for intelligently dividing data themes, equipment and storage medium
  • Method and system for intelligently dividing data themes, equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0054] figure 2 It is a schematic diagram of the steps of a method for intelligently dividing data subjects provided by the present invention. Such as figure 2 As shown, this embodiment discloses a specific implementation of a method for intelligently dividing data subjects (hereinafter referred to as "method").

[0055] Specifically, the method disclosed in this embodiment mainly includes the following steps:

[0056] Step S1: Create a page through front-end and back-end interaction, and connect to hive, and obtain all tables and table structure information in the database through the page;

[0057] Step S2: Select a fact table as a business line, and select business fields in the fact table to deduplicate and clean;

[0058] Specifically, add deduplication rules and cleaning rules, use the deduplication rules to deduplicate the business fields in the fact table according to business needs, and use the cleaning rules to remove fields with empty key fields, dirty fields, ...

Embodiment 2

[0075] In combination with a method for intelligently dividing data subjects disclosed in Embodiment 1, this embodiment discloses a specific implementation example of a system for intelligently dividing data subjects (hereinafter referred to as "system").

[0076] refer to Figure 4 As shown, the system includes:

[0077] Data source acquisition unit 1: Create pages through front-end and back-end interactions, and connect to hive, and obtain all tables and table structure information in the database through the pages;

[0078] Fact table selection unit 2: select a fact table as a business line, and select business fields in the fact table for deduplication and cleaning;

[0079] Dimension table degradation unit 3: Select the dimension table, and check the fields that must be retained after degradation in the degradable dimension table, check the business fields after saving, and perform deduplication and cleaning, and generate dimensions after all checks are completed wide t...

Embodiment 3

[0086] combine Figure 5 As shown, this embodiment discloses a specific implementation manner of a computer device. The computer device may comprise a processor 81 and a memory 82 storing computer program instructions.

[0087] Specifically, the processor 81 may include a central processing unit (CPU), or an Application Specific Integrated Circuit (ASIC for short), or may be configured to implement one or more integrated circuits in the embodiments of the present application.

[0088]Among them, the memory 82 may include mass storage for data or instructions. For example without limitation, the memory 82 may include a hard disk drive (Hard Disk Drive, referred to as HDD), a floppy disk drive, a solid state drive (SolidState Drive, referred to as SSD), flash memory, optical disk, magneto-optical disk, magnetic tape or universal serial bus (Universal Serial Bus, referred to as USB) drive or a combination of two or more of the above. Storage 82 may comprise removable or non-re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and system for intelligently dividing data themes, equipment and a storage medium, and the method comprises the steps: making a page through interaction of a front end and a rear end, connecting the page to hive, and obtaining all tables and table structure information in a database through the page; selecting a fact table as a business line, and selecting business fields in the fact table to perform duplicate removal and cleaning; selecting a dimension table, checking fields which need to be reserved after degradation in the dimension table which can be degraded, checking business fields after storage, performing duplicate removal and cleaning, and generating a dimension wide table after all checking is completed; and checking the dimension wide table needing to be analyzed of the business line through a page, and selecting the associated fields of the fact table one by one for association. The development time can be greatly shortened, and dimension modeling can be simply and efficiently completed.

Description

technical field [0001] The invention relates to the technical field of databases, in particular to a method, system, device and storage medium for intelligently dividing data subjects. Background technique [0002] Data warehouse is a new database technology developed rapidly in the information field in recent years. The establishment of a data warehouse can make full use of existing data resources, convert data into information, dig out knowledge from it, refine it into wisdom, and finally create benefits. Through the analysis of data in the data warehouse, it can help enterprises improve business processes and control cost, quality improvement, etc. The layering of the data warehouse can simplify complex problems. Decomposing tasks into multiple layers can quickly read and locate problems, avoid repetitive development, and enhance reusability. [0003] The current mainstream modeling methods include ER model and dimensional model. Among them, ER model is usually used for...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/28G06F16/21G06F16/215G06F16/22G06F16/242
CPCG06F16/283G06F16/215G06F16/2282G06F16/212G06F16/2433
Inventor 高源
Owner BEIJING XUEZHITU NETWORK TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products