Data governance method based on datax

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A data and data synchronization technology, applied in the field of data governance, can solve problems such as inability to process data and manage decision-making report presentation, do not support concurrent processing of massive tasks, and tools do not support real-time data processing, etc.

Pending Publication Date: 2022-07-26

北京资采信息技术有限公司

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0014] 1. The cost of learning and configuring existing ETL tools is high, and some tools do not support real-time data processing. For some industries with high real-time requirements, real-time data processing and management decision report presentation cannot be done;

[0015] 2. Since there are many versions of business databases on the market, most ETLs only support the collection and loading of some data sources. This is also a particularly common problem in data governance. The root cause of the problem is that most data companies continue to launch their own For database products, there is a trend of a hundred flowers blooming in the database market; this makes version adaptation difficult;

[0016] 3. The existing ETL data governance tools do not adopt a distributed and scalable technical framework and do not support concurrent processing of massive tasks, resulting in untimely data processing

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0041] see Figure 1-3 , a datax-based data governance method according to an embodiment of the present invention includes:

[0042] S101. The job that datax completes a single data synchronization is called a job. After datax receives a job, it will start a process to complete the entire job synchronization process. The dataxJob module is the central management node of a single job, which is responsible for data cleaning, subtask switching Functions such as division, TaskGroup management, etc.;

[0043] After S103 and dataxJob are started, the job will be divided into multiple small tasks according to different source-end segmentation strategies to facilitate concurrent execution, and each task will be responsible for the synchronization of a part of the data;

[0044] S105. After dividing multiple tasks, dataxJob will call the Scheduler module, and recombine the divided tasks according to the configured amount of concurrent data, and assemble them into a TaskGroup;

[0045...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a data governance method based on data x, which comprises the following steps that: step 1, the data x completes a single data synchronization operation called Job, after the data x receives a Job, a process is started to complete the whole operation synchronization process, and a data xJob module is a central management node of a single operation and undertakes the functions of data cleaning, subtask segmentation, TaskGroup management and the like; and step 2, after the dataxJob is started, the Job is segmented into a plurality of small Tasks according to different source end segmentation strategies so as to facilitate concurrent execution, and each Task is responsible for synchronization work of a part of data. The method has the beneficial effects that the problem of individual type distortion of data transmission is perfectly solved, monitoring of traffic and data volume of a whole link of operation during operation is provided, and dirty data detection, rich data conversion functions, accurate speed control and a robust fault-tolerant mechanism are provided.

Description

technical field [0001] The invention relates to the technical field of data governance, in particular to a datax-based data governance method. Background technique [0002] In the era of big data, with the development of enterprises, each business line, product line, and department will undertake various information systems to facilitate their own business. With the continuous deepening of informatization construction, the phenomenon of “data islands” caused by the independence and independence of business systems is particularly common. Businesses are not integrated, processes are not interoperable, and data is not shared. This allows enterprises to analyze and utilize data and develop reports. , analysis and mining, etc. have brought great difficulties. In this case, in order to realize the systematic operation and management of enterprise global data (information islands, data statistics, data analysis, data mining), dSS (decision support system), BI (business Intelligen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06F16/25

CPCG06F16/254

Inventor 苏小东孙冰孙奇于欢

Owner 北京资采信息技术有限公司

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Data governance method based on datax

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology