Connection query system and method for distributed data warehouse

A distributed data and connection query technology, applied in the database field, can solve problems such as low resource utilization, and achieve the effects of improving efficiency, resource utilization, and performance

Inactive Publication Date: 2012-05-23
NEC (CHINA) CO LTD
View PDF6 Cites 113 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

This patented technology helps optimize use of resources by dividing up large amounts of data across multiple servers or storage devices instead of sending it all over one network. It uses specialized software that allows users to connect with specific files quickly without having any unnecessary connections being sent through other parts of their computer networks. Additionally, this technique increases the number of connected computers while reducing redundancy levels. Overall, these improvements help make more efficient and effective data management systems run better than traditional methods like storing them locally at each server individually.

Problems solved by technology

Technological Problem: Current solutions involve distributing computational resources across multiple servers or machines, leading to slower communication times and lower throughput rates compared to more efficiently distributed computation models. Additionally, current methods require significant amounts of space allocation and may lead to poor overall performance if too much connections occur simultanously.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Connection query system and method for distributed data warehouse
  • Connection query system and method for distributed data warehouse
  • Connection query system and method for distributed data warehouse

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In the following, the principle and implementation of the present invention will become apparent by describing specific embodiments of the present invention in conjunction with the accompanying drawings. It should be noted that the present invention should not be limited to the specific examples described below. Also, detailed descriptions of well-known elements are omitted for brevity.

[0042] image 3 A block diagram of a shard join query system for a distributed data warehouse according to an embodiment of the present invention is shown. As an example, in image 3 1 master node 50, 3 map worker nodes 60, and 2 reducer worker nodes 70 are shown in , and like numbers in the figures denote like elements. However, it should be understood that the present invention can be applied to a distributed system including any number of mapping working nodes and any number of reducing working nodes. Generally speaking, the number of mapping working nodes is greater than the nu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a connection query system and a connection query method for a distributed data warehouse. The system comprises a master node, mapping work nodes and reduction work nodes, wherein the master node calculates a fragment size according to the size of a data table and system performance, allocates a data block to the mapping work node based on the calculated fragment size, and formulates fragmentation mapping rules and summarization rules in the mapping work node; the mapping work node maps a query keyword in the data block to a corresponding fragment number according to the fragmentation mapping rules, and transmits data with the same fragment number to a specified reduction work node according to the summarization rules; and the reduction work node receives the data from the mapping work node, combines the data with the same fragment number and establishes connection according to the query keyword to obtain a final connection query result. By the system and the method, data transmission in a distributed system is reduced, the data volume and program complexity of the reduction work node are decreased, and the performance of the distributed data warehouse is improved.

Description

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Owner NEC (CHINA) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products