Data labeling method and device and data processing equipment

A technology for data and labeling data, applied in the field of data processing, which can solve the problems of low model accuracy, error-prone, low efficiency, etc.

Pending Publication Date: 2020-06-26
BEIJING DIDI INFINITY TECH & DEV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, labeling data is mainly obtained by manually adding labels, which is inefficient and error-prone, resulting in low accuracy of the final trained model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data labeling method and device and data processing equipment
  • Data labeling method and device and data processing equipment
  • Data labeling method and device and data processing equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0078] In order to make the purpose, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. It should be understood that the appended The figures are for the purpose of illustration and description only, and are not used to limit the protection scope of the application. Furthermore, it should be noted that the schematic drawings are not drawn to scale. The flowcharts used in this application illustrate operations implemented by some embodiments of the application. It should be understood that the operations of the flowcharts may be performed out of order, and steps that have no logical context may be performed in reverse order or concurrently. In addition, those skilled in the art may add one or more other operations to the flowchart or remove one ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data labeling method and device and data processing equipment, and the method comprises the steps: carrying out the at least one iteration processing of a classification model, so as to enable the accuracy of the classification model to meet a preset condition; and processing at least one part of the plurality of pieces of to-be-labeled data by utilizing the obtained classification model to obtain an automatic labeling result, wherein each iterative processing comprises the following steps of: respectively inputting other to-be-labeled data, except a target data set, in a plurality of pieces of to-be-labeled data into the classification model to obtain a classification result; selecting at least part of the to-be-labeled data of which the confidence coefficient ofthe classification result is within a preset range from the other to-be-labeled data, and adding the selected to-be-labeled data into a target data set; and training a classification module accordingto the manual labeling result of the to-be-annotated data in the target data set. Therefore, automatic labeling of batch data can be realized under the condition of improving the data labeling quality.

Description

technical field [0001] The present application relates to the technical field of data processing, and in particular, to a data labeling method, device, and data processing equipment. Background technique [0002] With the development of computer technology, the application of machine learning algorithms has become more and more extensive, and supervised learning algorithms are one of the commonly used algorithms. Supervised learning algorithms usually need to use a large number of labeled data to train the pre-established recognition model. The quantity and accuracy of the labeled data directly affect the accuracy of the trained recognition model. [0003] At present, the labeled data is mainly obtained by manually adding labels, which is inefficient and error-prone, resulting in low accuracy of the final trained model. Contents of the invention [0004] In view of this, the purpose of the embodiments of the present application is to provide a data labeling method, device...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/62
CPCG06F18/214G06F18/24
Inventor 冯浩徐江王鹏
Owner BEIJING DIDI INFINITY TECH & DEV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products