Data partitioning method for distributed parallel database system

A data partitioning and database technology, applied in the field of distributed parallel database systems, can solve problems affecting query efficiency, increase query response time, and consume network bandwidth, so as to improve query efficiency, reduce query response time, and avoid network transmission consumption. when the effect

Active Publication Date: 2010-12-15
BORQS BEIJING +2
View PDF3 Cites 95 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This kind of data transfer between nodes during query is also called dynamic repartitioning (Dynamic Repartitioning), which will not only con

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data partitioning method for distributed parallel database system
  • Data partitioning method for distributed parallel database system
  • Data partitioning method for distributed parallel database system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The preferred embodiments of the present invention will be described below in conjunction with the accompanying drawings. It should be understood that the preferred embodiments described here are only used to illustrate and explain the present invention, and are not intended to limit the present invention.

[0025] When building a database system or a data warehouse based on a distributed database, the actual fact data and the data used to describe attributes are generally separated by different tables, and the actual fact data is stored in a class called fact Table (Fact table), and data describing attributes from different perspectives are placed in different dimension tables (Dimension table). For example, a Sales database or data warehouse can be designed like this. Each sales record should include the product sold, the customer sold, the supplier of the product, the time of sale, the quantity sold, and the income obtained. For specific digital data such as sales qu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data partitioning method for a distributed parallel database system. The method comprises the following steps of: establishing a fact table and a dimension table according to the constructed distributed parallel database system; inserting records of the dimension table and the fact table on different nodes according to a partitioning rule; copying the records of the dimension table to the nodes of the fact table; and deleting and updating the data. When a data set or data stream is imported or inserted into the distributed database system in a partitioning way, the relation between tables defined by a database schema can be met on each node, particularly the primary key-foreign key restrictive condition, so the data on each node has local completeness of the data. For the query processing on the connection between the tables by using the primary key-foreign key restrictive condition, the data of each node has the local completeness on the query, so dynamic repartitioning of data between the nodes is not needed; and thus the method has the advantages of preventing time-consuming network transmission of the data, shortening the query response time and improving the query efficiency.

Description

technical field [0001] The invention relates to a distributed parallel database system, in particular to a data partition method of the distributed parallel database system. Background technique [0002] Storing data in databases is a common method of data management, especially in relational databases. We can choose a mature database management system (DBMS: Database Management System) according to the data needs to be managed, and use a standard data definition language (such as SQL DDL) to define data tables (Table) or relations (Relation), data structures , index, primary key (Primary Key) and foreign key (Foreign Key) and other information of the database data management program (Database Schema), deploy the database system. According to the data manipulation language (such as SQL DML) provided by the DBMS, the application program can perform data manipulation, such as inserting, querying, updating, importing and exporting, etc. [0003] In many current industry appli...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F17/30584G06F16/278
Inventor 张卫平张松波刘为怀
Owner BORQS BEIJING
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products