Unlock instant, AI-driven research and patent intelligence for your innovation.

Implementation method for data quality control

A data quality control and implementation method technology, applied in the field of data quality control, can solve problems such as inability to perform data optimization, data alarm and warning, etc., and achieve the effect of optimizing processing and optimizing sequences

Active Publication Date: 2020-10-02
BEIJING TONGTECH CO LTD +3
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The invention provides an implementation method for data quality control, which is used to solve the situation that in the process of data quality control, there are low-quality sequences, data optimization cannot be performed, and data alarm and warning cannot be performed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Implementation method for data quality control

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0074] as attached figure 1 The shown flow chart of a method for implementing a data quality control method of the present invention includes:

[0075] Step 100: Obtain the target attribute of the target data, perform sequence extraction on the target data according to the target attribute, and obtain sequence data;

[0076] Step 101: Determine the association relationship between the sequence data, and perform quality supervision and measurement on the sequence data based on the quality control algorithm and the association relationship, and determine the low-quality sequence;

[0077] Step 102: Optimizing the low-quality sequence according to a preset optimized sequence library to obtain an optimized sequence;

[0078] Step 103: Verify whether the optimized sequence meets the control standard, and give an alarm to the optimized sequence that does not meet the control standard.

[0079] The principle of the above technical solution is: in the process of data quality control...

Embodiment 2

[0082] As an embodiment of the present invention, the target attributes of the acquired target data include:

[0083] Determining the space complexity of various types of data in the target data (that is, the measurement of the storage space occupied by various types of data in the target data), and based on the space complexity, determining the space attribute of the target data;

[0084] Determining the information entropy of all types of data in the target data (that is, the quantitative measurement of various types of data in the target data), performing gradient division on the entropy value of the information entropy, and determining the target data based on the gradient of the entropy value The entropy value attribute;

[0085] Determine the degree of correlation of various types of data in the target data (that is, the Mahalanobis distance between various types of data in the target data), and determine the relationship attributes of the target data based on the degree...

Embodiment 3

[0090] As an embodiment of the present invention: performing sequence extraction on the target data according to the target attribute to obtain sequence data includes:

[0091] generating a corresponding sequence code in the target data based on the target attribute;

[0092] Counting the sequence codes, and generating a key-value sequence of the sequence codes through a key-value function;

[0093] According to the key-value sequence, data corresponding to the key-value sequence in the target data is determined to generate sequence data.

[0094] The principle of the above-mentioned technical solution is: after the sequence data is determined, the present invention can digitize the target data because the target attribute has been determined, and the target data after digitization can be coded by sequence, and the sequence code is performed in the form of computer language Numericalization, finally determine the key value of the sequence data through the target data after di...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an implementation method for data quality control, which comprises the following steps of obtaining a target attribute of target data, and performing sequence extraction on thetarget data according to the target attribute to obtain sequence data; determining an association relationship between the sequence data, performing quality supervision measurement on the sequence data based on a quality control algorithm and the association relationship, and determining a low-quality sequence; optimizing the low-quality sequence according to a preset optimization sequence libraryto obtain an optimization sequence; and verifying whether the optimization sequences meet the management and control standard or not, and carrying out alarm warning on the optimization sequences which do not meet the management and control standard. The method has the beneficial effect that effective analysis and attribute division of the target data are realized. Through quality control of the target data, the data quality is determined, so that optimization processing of the data is realized, and a better optimization sequence is obtained. And the obtained target data are all high-quality data through management and control and alarm warning of the sequence data.

Description

technical field [0001] The invention relates to the technical field of data management, in particular to an implementation method for data quality control. Background technique [0002] At present, in the process of data management and control processing, there will be many links. Due to factors such as the filtering method, cleaning method, original data extraction rules, whether the conversion process is successfully executed, and whether the loading process type is correct, etc., in each link, data records are lost, data is inaccurate, conversion process fails, timeouts, etc. . When locating these links, due to the many links, the use of many technologies, and the many causes of the problems, maintenance personnel have no way to locate the problem, or spend a lot of time on data verification, which is laborious and unnecessary. Must be able to accurately locate the problem. There are a series of problems, such as low filling rate of key fields, unreasonable analysis me...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/215G06F11/32
CPCG06F11/327G06F16/215
Inventor 张春林李利军李春青常江波尚雪松
Owner BEIJING TONGTECH CO LTD