Data governance method based on double message queues

A message queue and dual message technology, which is applied in the field of distributed computing and data processing, can solve the problems of untraceability and low security, and achieve the effects of easy code refactoring, reliable data, and strong scalability

Inactive Publication Date: 2019-01-04
GLOBAL TONE COMM TECH
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0011] In order to solve the problems of delay, untraceability, and low security in the existing data governance methods, the present invention provides a data governance method based on dual message queues. The method is inserted into the message queue b

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data governance method based on double message queues

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0034] Example

[0035] See attached figure 1 .

[0036] A data management method based on dual message queues, the method includes the following steps:

[0037] 1) Input data from user data sources through data access tools;

[0038] 2) Store the accessed data resources in the message queue kafka cluster;

[0039] 3) The data management program extracts data from the message queue for preprocessing operations such as cleaning;

[0040] 4) The data management program stores the preprocessed data into the original Hbase database, and at the same time submits the data to the Kafka message queue again;

[0041] 5) Various data management programs extract data from the message queue for governance, and then store the governance results in the message queue kafka cluster again;

[0042] 6) The last governance program extracts data from the message queue, and after the governance is completed, the governance results are stored in the result databases HBase, ElasticSearch, and RabbitMQ for use i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data governance method based on a double message queue, includes the following steps: 1) inputting data from a user data source through a data access tool, 2) storing the accessed data resources in a message queue, 3) extracting data from that message queue for preprocessing operation such as cleaning, 4) re-storing the pre-processed data into a message queue, 5) variousdata governance program respectively extract data from that message queue for governance, and then store the governance result into the message queue again; 6) the last governance program extracting data from the message queue, and after the governance is completed, the governance result being stored into a result database for use by subsequent processes. The method inserts the message queue before and after the data governance, buffers the data before and after the governance, realizes the streaming processing of the data, and optimizes the whole data processing link from the aspects of reliability, availability, scalability, data security and performance.

Description

technical field [0001] The invention belongs to the technical field of distributed computing and data processing, and in particular relates to a data management method based on dual message queues. Background technique [0002] Data governance is the process of reading data from one storage medium, going through a series of data governance links, and then storing it in another storage medium. For data governance with a large amount of data, there are two traditional methods: one is to read sequentially through a single thread, and then write to the target storage medium sequentially; the other is to read data in parallel through some rules and then write in parallel into the target stored procedure. However, in the process of governance, there will always be some problems: [0003] 1. There is a delay in data governance: the above two methods, either batch reading or writing, or timing reading and writing, cannot achieve real-time reading and writing, and are not applicabl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/25G06F16/215G06F16/2455
Inventor 张宝华程国艮
Owner GLOBAL TONE COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products