Data partition method and device
A data partitioning and data technology, applied in the database field, can solve problems such as low efficiency of join operation, waste of network and storage space, and large amount of data transmission
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0063] The embodiment of the present invention provides a data partition method, such as figure 1 Shown, including:
[0064] 101. The device partitions the dimension tables in the distributed database.
[0065] Among them, the device here can be a computer with its own disk and a central processing unit (CPU). The application scenario of the embodiment of the present invention may be a data distribution problem in a distributed massive parallel processing (Massive parallel processing, mpp) database.
[0066] Specifically, in a distributed mpp database, when there are two tables with a join (join query) relationship, the dimension table can be partitioned according to a general algorithm. The general algorithm here can be a hash algorithm, for example, dimension The table is order (order), the primary key is the O_PK order column, and the foreign key is the C_PK customer column. The dimension table can be partitioned according to C_PK. Because the default C_PK is assigned to differen...
Embodiment 2
[0075] The embodiment of the present invention provides a data partition method, such as figure 2 Shown, including:
[0076] 201. The device partitions the dimension tables in the distributed database.
[0077] Among them, the device here can be a computer, which is applied to a distributed mpp database to solve the problem of data distribution. The Mpp architecture can distribute my data to multiple nodes and process them in parallel by multiple nodes, which can increase the data processing speed.
[0078] When there is a join query operation between the two tables, the dimension table can be partitioned according to a general algorithm. The general algorithm here can be a hash algorithm. The dimension table is used to store the attributes of the object data in the fact table. .
[0079] For example, it can be the test standard of the half of the organization using the general benchmark test
[0080] BenchmarkTPC-H (TransactionProcessingPerformanceCouncil-H) introduces the data sche...
PUM

Abstract
Description
Claims
Application Information

- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com