Data management method and system based on data consanguinity analysis

A blood relationship and data technology, applied in the field of data processing, can solve the problems of difficulty in data traceability, verification, and correlation analysis, and achieve the effect of improving data governance efficiency and facilitating data analysis and utilization.

Pending Publication Date: 2021-05-14
SHANDONG LANGCHAO YUNTOU INFORMATION TECH CO LTD
View PDF11 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] The technical task of the present invention is to provide a data governance method and system based on data blood relationship analysis to solve the problems of how to overcome the difficulties of data traceability, verification and correlation analysis in the process of data governance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data management method and system based on data consanguinity analysis
  • Data management method and system based on data consanguinity analysis
  • Data management method and system based on data consanguinity analysis

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0062]Appendfigure 1 As shown, the present invention is based on the data control method of data-based blood analysis. The method is to analyze the data family relationship mesh map, and confirm each other's data in the mesh spectrum, which helps data governance to complete Trace data, verify, supplement, and standardize data, improve data management efficiency; specifically as follows:

[0063]S1, scheduling and storing big data;

[0064]S2, the data is performed to form a data family map;

[0065]S3, construct a data map through an algorithm model.

[0066]In this embodiment, the maximum data of S1 is scheduled and stored as follows:

[0067]S101, the relevant data resource is scheduled to the database of the HBASE through NIFI data scheduler;

[0068]S102, during the scheduling process, standardize the field name, cleaning the key field, convenient for blood analysis.

[0069]In this embodiment, the data family map is made in this embodiment, and the data family is specifically as follows:

[0070]S201,...

Embodiment 2

[0090]Appendfigure 2As shown, a data governance system based on a data-based blood analysis, including a large data scheduling storage module for scheduling storage;

[0091]Data blood analysis module, used to analyze the data relationship, generate data family maps;

[0092]The algorithm model module is used to automatically analyze the data-related relationship to form a data map through each node key field index, and is also used to manage data quality and analyze data relationships, extract data value.

[0093]The large data scheduling storage modules in this embodiment include,

[0094]The library module is used to schedule the data;

[0095]Standardized submodules for normalization of data fields during scheduling;

[0096]Cleaning the child module for cleaning the key field.

[0097]The data blood analysis module in this embodiment includes,

[0098]Quroom submodule 1. Used to query master data nodes;

[0099]Subsequent submodulation 2 for querying data flow nodes;

[0100]The query submodule is used to q...

Embodiment 3

[0104]The embodiment of the present invention also provides a computer readable storage medium in which a plurality of instructions are stored, and the instruction is loaded by a processor, and the processor performs a data governance method based on data-based blood analysis based in either embodiment of the present invention. Specifically, a system or apparatus equipped with a storage medium can be provided, and a software program code that implements a function of any of the embodiments in the above embodiment, and a computer (or CPU or MPU) of the system or device (or MPU) ) Read and execute program code stored in the storage medium.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data management method and system based on data consanguinity analysis, belongs to the technical field of data processing, and aims to solve the technical problem of how to overcome difficulty in data traceability, verification and association analysis in the data governance process. A data family relationship mesh map is constructed, and node data in the mesh map are mutually verified and expanded, so that data management personnel are helped to complete data tracing, verification, supplementation and standardization, and the data governance efficiency is improved; the method specifically comprises the following steps: scheduling and storing big data; performing blood relationship analysis on the data to form a data family map; and constructing a data graph through the algorithm model. The system comprises a big data scheduling storage module, a data consanguinity analysis module and an algorithm model module.

Description

Technical field[0001]The present invention relates to the field of data processing, and in particular, a data governance method and system based on data blood analysis.Background technique[0002]Big data era, data explosive growth, massive, and various types of data are rapidly generated. These huge complex data information have been called, converted, converted, and generated, generating new data, and gathering the ocean of data.[0003]Human blood relationships are interpersonal relationships generated by marriage or fertility, such as the relationship between parents and children, brothers and sisters, and other relatives that are derived. In the process of production, processing, flow and death, data will naturally form a relationship, learn from a similar relationship in human society to express this relationship between data, called the blood relationship of data. .[0004]Data blood has the following characteristics:[0005]1 Raising: Data is owned by a particular organization or in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/28G06F16/215G06F16/26
CPCG06F16/284G06F16/215G06F16/26Y02D10/00
Inventor 王泽宇宋海涛尹曦萌于春蕾张正奇
Owner SHANDONG LANGCHAO YUNTOU INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products