Unlock instant, AI-driven research and patent intelligence for your innovation.

An annotation data processing method and device, and medium

A technology for labeling data and processing methods, applied in the field of big data, can solve the problems of low data utilization rate, low efficiency, and inability to further utilize the labeled data, and achieve the effect of improving efficiency and utilization rate

Inactive Publication Date: 2021-05-04
北京中关村科金技术有限公司
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The above process has the following problems: First, the labeled data after verification and verification cannot be further used, and the utilization rate of the data is low; in addition, after the labeling data with unqualified correctness is returned to the labeler, the labeler does not know which labeling data is wrong, so only all data can be relabeled, so the efficiency of relabeling labeled data is relatively low

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An annotation data processing method and device, and medium
  • An annotation data processing method and device, and medium
  • An annotation data processing method and device, and medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0032] According to this embodiment, an embodiment of a method for processing labeled data is also provided. It should be noted that the steps shown in the flow charts of the drawings can be executed in a computer system such as a set of computer-executable instructions, and , although a logical order is shown in the flowcharts, in some cases the steps shown or described may be performed in an order different from that shown or described herein.

[0033] The method embodiments provided in this embodiment can be executed in mobile terminals, computer terminals, servers or similar computing devices. figure 1 A block diagram of a hardware structure of a computing device for implementing a method for processing labeled data is shown. Such as figure 1 As shown, the computing device may include one or more processors (processors may include but not limited to processing devices such as microprocessors MCUs or programmable logic devices FPGAs), memory for storing data, and memory fo...

Embodiment 2

[0088] image 3 It is a schematic diagram of an apparatus for processing tagged data provided by an embodiment of the present disclosure, and the apparatus 300 corresponds to a method for processing tagged data according to Embodiment 1. refer to image 3 As shown, the device 300 includes:

[0089] Annotated data acquisition module 301, configured to acquire the annotated data annotated by an annotated person;

[0090] The wrong data determination module 302 is configured to determine wrongly labeled data in the labeled data by using a pre-built labeled data audit model according to the labeled data, wherein the labeled data audit model is trained using reviewed labeled data;

[0091] An annotation data sending module 303, configured to send the erroneous annotation data to the annotator.

[0092] Optionally, the error data determination module 302 is specifically configured to:

[0093] Substituting the labeled data into a pre-built labeled data review model to obtain the...

Embodiment 3

[0114] Figure 4 It is a schematic diagram of an apparatus for processing tagged data provided by another embodiment of the present disclosure, and the apparatus 400 corresponds to the method according to the first aspect of Embodiment 1. refer to Figure 4 As shown, the device 400 includes: a processor 410; and a memory 420, connected to the processor 410, for providing the processor 410 with an instruction for processing the following processing steps: obtaining the labeling data marked by the labeling personnel;

[0115] Determining incorrectly labeled data in the labeled data by using a pre-built labeled data audit model based on the labeled data, wherein the labeled data audit model is trained using reviewed labeled data;

[0116] Sending the erroneous labeling data to the labeling personnel.

[0117] Using a pre-built label data review model according to the label data to determine wrong label data in the label data, including:

[0118] Substituting the labeled data i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an annotation data processing method and device and a storage medium. The method comprises the steps of obtaining annotation data annotated by annotation personnel; according to the annotation data, determining wrong annotation data in the annotation data by using a pre-constructed annotation data auditing model, the annotation data auditing model being trained by using audited annotation data; and sending the error annotation data to the annotation personnel. According to the embodiment of the invention, the utilization rate of the audited annotation data and the annotation efficiency of re-annotating the annotation data can be improved.

Description

technical field [0001] The present application relates to the field of big data, in particular to a processing method, device and medium for labeling data. Background technique [0002] With the development of communication technology, the demand for labeled data in artificial intelligence and other fields is increasing. Whether it is in the field of image recognition or text classification, there is a high requirement for the accuracy of labeled data. [0003] The current method for labeling data is to manually label the data through the labeling staff, extract part of the labeling data from a batch of labeling data for review and inspection, and calculate the accuracy of the labeling data after review and inspection. If the accuracy rate does not meet the standard, If it is judged that the accuracy rate of the batch of labeled data is unqualified, the labeler needs to re-label the batch of data until the accuracy rate is qualified. [0004] The above process has the follo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06F16/35
CPCG06F16/35G06F18/40G06F18/2415G06F18/241
Inventor 刘睿靳丁南罗欢权圣
Owner 北京中关村科金技术有限公司