Data auditing method and device

A data and number technology, applied in the computer field, can solve problems such as the inability to guarantee the quality of label data review, and the inability to guarantee label data, etc., to achieve the effect of reducing data review costs, ensuring review quality, and improving return on investment

Inactive Publication Date: 2019-04-30
BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD +1
View PDF5 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] 2. According to the manual sampling review method to review the marked data, it cannot guarantee tha

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data auditing method and device
  • Data auditing method and device
  • Data auditing method and device

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0033] The following describes exemplary embodiments of the present invention with reference to the accompanying drawings, which include various details of the embodiments of the present invention to facilitate understanding, and should be regarded as merely exemplary. Therefore, those of ordinary skill in the art should realize that various changes and modifications can be made to the embodiments described herein without departing from the scope and spirit of the present invention. Likewise, for clarity and conciseness, descriptions of well-known functions and structures are omitted in the following description.

[0034] For supervised machine learning, it is usually necessary to use a large amount of labeled data to train the algorithm model, and the quality of the labeled data will directly affect the quality of the algorithm model, so the review of the labeled data has become extremely important. However, in order to ensure the quality of the data, the existing methods of rev...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data auditing method and device, and relates to the technical field of computers. A specific embodiment of the method comprises the following steps: allocating a marking taskto a marker according to a predetermined rule, the marking task comprising a sample marking task, and each marker being allocated with a sample marking task; after the to-be-labeled person completesthe distributed labeling task, obtaining a labeling result of the labeling task; for each tagging person, obtaining a sample tagging result included in the tagging result of the tagging person, checking the sample tagging result, and then calculating the error rate of the tagging person; and if the error rate does not exceed the preset threshold value, the marking result of the marker is approved.According to the embodiment, the data auditing cost can be greatly reduced, the investment return rate is improved, and the auditing quality of the annotated data can be ensured.

Description

technical field [0001] The invention relates to the field of computer technology, in particular to a method and device for data verification. Background technique [0002] In today's era of rapid development of artificial intelligence, technologies such as speech recognition, image recognition, natural language processing, and video analysis have become the core competitiveness of artificial intelligence (AI) companies. At present, most companies generally use labeled data for algorithm model training when analyzing artificial intelligence, and optimize the model by continuously improving the algorithm to better simulate the information process of human consciousness and thinking. Among them, when using labeled data for model training, the higher the quality and the more the labeled data, the better the algorithm model can be trained. Therefore, the quality of labeled data needs to be strictly controlled. [0003] At present, when performing artificial intelligence analysi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06Q10/06G06Q10/10
CPCG06Q10/06311G06Q10/103
Inventor 刘愉
Owner BEIJING JINGDONG SHANGKE INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products