Data benchmarking method and device and storage device

A data matching and benchmarking technology, applied in the field of data governance, can solve problems such as easy mis-match, ineffective verification, lack of automatic addition of standard data items, etc., to achieve the effect of improving credibility and reducing the false-match rate

Active Publication Date: 2020-02-14
ZHEJIANG DAHUA TECH CO LTD
View PDF13 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] This application provides a data benchmarking method, device, and storage device, which can solve the problems in the prior art that are easy to match incorrectly, cannot be effectively verified, and lack the function of automatically adding standard data items

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data benchmarking method and device and storage device
  • Data benchmarking method and device and storage device
  • Data benchmarking method and device and storage device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The following will clearly and completely describe the technical solutions in the embodiments of the present application with reference to the accompanying drawings in the embodiments of the present application. Obviously, the described embodiments are only part of the embodiments of the present application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0026] The terms "first", "second", and "third" in this application are used for descriptive purposes only, and cannot be understood as indicating or implying relative importance or implicitly specifying the quantity of indicated technical features. Thus, features defined as "first", "second", and "third" may explicitly or implicitly include at least one of these features. In the description of the present application, "plurality" means at least t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data benchmarking method and device and a storage device. The data benchmarking method comprises the steps that original data information is extracted from a data table to bebenchmarked, and the original data information comprises field names and field annotations corresponding to the field names; identifying the field annotation based on a sequence annotation model of deep learning to obtain a characteristic word corresponding to the field name; carrying out first text matching on the characteristic words corresponding to the field names and standard data elements in a standard library; and verifying a result output after the first text is matched. By means of the mode, text matching is conducted on the basis that the feature words are recognized, the credibility of a text matching result is improved, and the mismatching rate in the benchmarking process is reduced.

Description

technical field [0001] The present application relates to the technical field of data governance, in particular to a data benchmarking method, device, and storage device. Background technique [0002] Data benchmarking is an important part of data governance, which is to benchmark non-standard data item representations to data item representations that meet standard specifications. Specifically, data item benchmarking can be split into two parts: data element (consisting of three major elements: object, characteristic word, and representation word) benchmarking and qualifier (object modifier) ​​benchmarking. In the prior art, there are many similarity matching methods based on field names. Due to the variety of actual non-standard field naming methods (usually including English, especially Chinese pinyin acronyms), it is easy to cause a mismatch for a large number of Chinese pinyin abbreviations. ; On the other hand, the existing technology does not identify the three eleme...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/25G06F16/33G06F16/35
CPCG06F16/258G06F16/3344G06F16/35
Inventor 戴泽林高圣兴朱明浩何林强
Owner ZHEJIANG DAHUA TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products