Parallel data processing method based on distributed structure

A distributed structure and data technology, applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve problems such as defects in basic queries, difficulties in query combination, and inability to support queries, so as to improve local query performance and ensure Safe, reduce the effect of storage throughput

Active Publication Date: 2013-11-27
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF3 Cites 78 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

From a global point of view, it cannot support the unified key value of all table objects corresponding to the data ontology query; this contradiction makes it difficult to combine RDB query and KV query;
[0009] 3) The challenge of massive data analysis to high-performance query: service computing involves complex access to a large number of complex elements, object attributes, monitoring data, multimedia data, remote sensing data, and various unstructured data
However, in the data management system that the traditional GIS platform relies on, due to the inherent deficiencies in the architecture, there is an insurmountable mechanism limitation in terms of high-performance query
This causes the GIS platform to have defects in basic queries in the face of increasingly expanding geographic data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel data processing method based on distributed structure
  • Parallel data processing method based on distributed structure
  • Parallel data processing method based on distributed structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0057] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. It should be understood that the described embodiments are only some of the embodiments of the present invention, not all of them. example. Based on the embodiments of the present invention, all other embodiments obtained by those skilled in the art without making creative efforts belong to the protection scope of the present invention.

[0058] The technical scheme adopted in the present invention is as follows:

[0059] Step 1: According to the characteristics of cloud storage environment, provide a parallel data processing method. The entire data cluster processing system adopts a two-tier and three-tier data organization and management structure. In this system, the organization process of data goes through two basic steps of global distribution and local storag...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a parallel data processing method based on a distributed structure. The storing comprises steps as follows: (1) a data master key value is extracted from master nodes according to types of master key values, directed slave nodes distributed by data are determined according to data attribute values and a section comparison result in the master nodes, and simultaneously, a global keyword B+ tree index is established; (2), the data are distributed to the slave nodes corresponding to the master key values according to the global keyword B+ tree index on the basis of a share-nothing principle; and (3), the slave nodes receive a data distributing request, and the data are stored in child nodes locally on the basis of the share-nothing principle. According to the method, an effective index mechanism is combined, and the storage and management efficiency of system data is improved; on one hand, the reasonable data distribution is guaranteed, the storage throughput of the slave nodes is reduced, the local query performance is improved, and the system flexibility is guaranteed by utilizing high expandability of the slave nodes; and on the other hands, local transcript safety is guaranteed through local duplication of multiple transcripts.

Description

technical field [0001] The invention is oriented to the fields of geographic information system, spatio-temporal data management, location-related services, large-scale sensor stream data management, etc., and proposes a set key-value The RDB-KV parallel cloud database storage and retrieval method with the advantages of both the database (Key-Value Store) and the relational database realizes a mass data storage technology that combines the efficient access characteristics of key-value storage and the integrity of the database. Background technique [0002] Cloud computing is an important direction of current information technology development. Computing and storage services based on cloud platforms have undergone major changes in application modes, application scope, and technical requirements due to changes in the underlying infrastructure. Cloud storage is a new concept extended and developed on the concept of cloud computing. It refers to the integration of a large numbe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 郭皓明丁治明刘奎恩许佳捷徐怀野李亚光张天为
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products