Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Data storage system, metadatabase synchronization method and data cross-domain calculation method

A data storage system and database technology, applied in the field of information processing, can solve the problems of high-cost investment, instability, and inability of data to flow out, and achieve the effect of improving performance and reducing network overhead.

Active Publication Date: 2019-05-07
TRANSWARP INFORMATION TECH SHANGHAI
View PDF10 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Each data center is equivalent to a domain. The network within the domain is fast, but the network between domains is much slower and unstable than the network within the domain. Therefore, if a large amount of network overhead is generated during joint computing, There will be a relatively large performance problem
[0003] At present, there are strongly consistent and scalable global distributed databases on the market. The above-mentioned global distributed databases have two main defects. One is that it requires high-cost investment, and the other is that it does not meet the data compliance requirements. Regulatory requirements, that is, the requirement that data in a data center cannot be outflowed to other data centers
[0004] The reason for the first defect is that in order to meet the performance requirements of available scenarios, it is necessary to reduce the delay between data centers to a very low standard, which inevitably requires relatively high investment and investment in the network between data centers. Optimization; the reason for the second defect is that, from a business perspective, the data centers of the same company may not be used when performing cross-data center calculations, because the possibility of all data centers using the same database at the same time is relatively small , and the existing global distributed databases are all calculated nearby through multiple copies, that is, when the data is written, it will be written to other data centers, so this method does not meet the data compliance requirements

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data storage system, metadatabase synchronization method and data cross-domain calculation method
  • Data storage system, metadatabase synchronization method and data cross-domain calculation method
  • Data storage system, metadatabase synchronization method and data cross-domain calculation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0036] figure 1 The structural diagram of the data storage system provided for Embodiment 1 of the present invention, such as figure 1 As shown, the data storage system includes: at least two data centers (as an example without limitation, in figure 1 Three data centers are shown in , namely: data center A, 110, data center B, 120 and data center C, 130).

[0037] Wherein, a communication connection is established between different data centers (for example, a public network or a private network is used for connection). Typically, each data center adopts a distributed database system as a whole.

[0038] Each data center (in figure 1 Take data center A, 110 as an example) including: access layer 1101, compilation layer 1102, computing layer 1103, storage layer 1104 and underlying container cloud platform 1105;

[0039] The compilation layer 1102 includes a metabase 11021 and at least one compilation node 11022 (in figure 1 Take three compilation nodes as an example), the ...

Embodiment 2

[0066] figure 2 It is a flow chart of a metadata database synchronization method provided by Embodiment 2 of the present invention. This embodiment is applicable to the case of performing data synchronization on the metadata database stored in each data center in the data storage system described in the embodiment of the present invention. The method can be executed by the metadata database synchronization device provided by the embodiment of the present invention. The device can be implemented in the form of software and / or hardware, and can generally be integrated in the data storage system. One or more compilation layers in the data storage system Execution, for example, is performed by the cooperation of each server integrated with the compilation layer of each data center in the data storage system.

[0067] Such as figure 2 As shown, the method of the embodiment of the present invention includes:

[0068] S210. In all metadata databases of the data storage system, de...

Embodiment 3

[0078] Figure 3a It is a flowchart of a method for synchronizing metadata databases provided by Embodiment 3 of the present invention. This embodiment is optimized based on the above embodiments. In this embodiment, the master database will be determined among all metadata databases in the data storage system And from the database, and the operation of establishing the cascaded topology diagram between the metadata databases is embodied. Correspondingly, the method of the embodiment of the present invention specifically includes:

[0079] S310. In all the metadata databases included in the data storage system, make statistics on the communication delay between any two metadata databases.

[0080] In this embodiment, all metadata databases included in the data storage system are connected in pairs. Correspondingly, the communication delay between the two meta-databases can be counted by sending and receiving test information between the two meta-databases. Furthermore, the p...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a data storage system, a metadatabase synchronization method and a data cross-domain calculation method. The data storage system comprises at least two data centers, wherein each data center comprises an access layer, a compiling layer, a computing layer, a storage layer and a bottom container cloud platform; The access layer is used for providing a unified data access interface; The compiling node is used for inquiring the metadatabase according to the received SQL statement, generating a matched execution plan and distributing the job task to the computing node for execution according to the execution plan; The computing nodes are used for acquiring data from data nodes of a data center where the computing nodes are located for computing according to the operationtasks and sending computing results to the computing nodes serving as summarizing nodes; The data node is used for storing data; And the bottom container cloud platform is used for carrying out containerization management on all services of the same data center. According to the technical scheme provided by the embodiment of the invention, the input cost is saved while the cross-domain computingservice is provided with high quality, and the data compliance requirement is met.

Description

technical field [0001] Embodiments of the present invention relate to information processing technologies, and in particular to a data storage system, metadata database synchronization, and data cross-domain computing methods. Background technique [0002] With the increasing amount of data and the need for business expansion, more and more enterprises have begun to deploy their own data centers. Because of the needs of some special industries, sometimes it is necessary to combine the data of multiple enterprises or organizations for joint computing, that is to say, it is necessary to solve the problem of data joint computing across data centers. Each data center is equivalent to a domain. The network within the domain is fast, but the network between domains is much slower and unstable than the network within the domain. Therefore, if a large amount of network overhead is generated during joint computing, There will be a relatively large performance problem. [0003] At p...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/27G06F16/25
Inventor 李光跃边雨刘汪根
Owner TRANSWARP INFORMATION TECH SHANGHAI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products