Data synchronization method, apparatus, device and storage medium thereof between clusters

A data synchronization and clustering technology, applied in the field of big data processing, can solve problems such as uneven distribution of topic partition messages, achieve the effects of ensuring security and consistency, improving efficiency, and facilitating viewing of data

Active Publication Date: 2019-02-26
SF TECH
View PDF7 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Using existing synchronization tools between consumer clusters, there is an uneven distribution of topic partition messages of the target cluster and topic partition messages of the source cluster, for example, the MirrorMaker tool

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data synchronization method, apparatus, device and storage medium thereof between clusters
  • Data synchronization method, apparatus, device and storage medium thereof between clusters
  • Data synchronization method, apparatus, device and storage medium thereof between clusters

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The application will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain related inventions, rather than to limit the invention. It should also be noted that, for ease of description, only parts related to the invention are shown in the drawings.

[0027] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present application will be described in detail below with reference to the accompanying drawings and embodiments.

[0028] Please refer to figure 1 , figure 1 A schematic flowchart of a method for synchronizing data between clusters provided by the embodiment of the present application is shown.

[0029] Such as figure 1 As shown, the method includes:

[0030] Step 110, read the target message offset of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A method, apparatus, apparatus, and storage medium for data synchronization between clusters are disclosed. The method includes reading a target message offset of a first partition of a first topic ofa target cluster; Comparing a target message offset with an oldest message offset of a second partition of a second subject of the source cluster, the second subject having the same subject name as the first subject and the first partition having the same partition sequence number as the second partition; If the target message offset is less than the oldest message offset, filling the data into the first partition; And synchronizing the master copy of the second partition to the first partition. According to the technical proposal of the embodiment of the present application, the problem thatthe subject partition data of the source cluster and the target cluster are unevenly distributed in the prior art is overcome by data filling processing.

Description

technical field [0001] The present application generally relates to the technical field of big data processing, specifically relates to the technical field of kafka, and especially relates to a data synchronization method, device, equipment and storage medium between clusters. Background technique [0002] With the development of big data, such as large-scale parallel processing databases, data mining, distributed file systems, distributed databases, cloud computing platforms, etc. are constantly updated. [0003] As a high-throughput distributed publish-subscribe messaging system, Kafka supports the distinction of messages through Kafka servers and consumer clusters. There is an uneven distribution of topic partition messages of the target cluster and topic partition messages of the source cluster using existing synchronization tools between consumer clusters, for example, the MirrorMaker tool. Contents of the invention [0004] In view of the above defects or deficienci...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/27G06F9/54
CPCG06F9/546
Inventor 陈文彪林国峰曾宪成
Owner SF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products