Data set tagging method and related apparatus

A data set and data technology, applied in the computer field, can solve the problems of reducing the overall training efficiency of supervised learning and failing to achieve the expected effect of test data, so as to achieve the effect of improving overall efficiency and improving efficiency

Inactive Publication Date: 2018-05-22
北京中关村科金技术有限公司
View PDF3 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, in the general manual labeling process, it is inevitable to label a large number of data that can be recognized by existing recognition models. These data that can be recognized by the recognition model cannot make the recognition model better tested, that is, it is impossible to achieve the recognition model. The expected effect of labeling test data reduces the overall training efficiency of supervised learning

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data set tagging method and related apparatus
  • Data set tagging method and related apparatus
  • Data set tagging method and related apparatus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The core of this application is to provide a data set labeling method, labeling device, server, and computer-readable storage medium. By filtering the data set according to uncertainty, an uncertain data set suitable for model processing is obtained. The data set Labeling can improve the efficiency of model training and testing, achieve better results with less data, and improve the overall efficiency of supervised learning.

[0052] In order to make the purposes, technical solutions and advantages of the embodiments of the present application clearer, the technical solutions in the embodiments of the present application will be clearly and completely described below in conjunction with the drawings in the embodiments of the present application. Obviously, the described embodiments It is a part of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art with...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a data set tagging method. The method comprises the steps of selecting untagged data from original data according to a preset rule to obtain a candidate data set; performing uncertainty analysis on the candidate data set, and performing to-be-tagged data screening according to an analysis result, thereby obtaining a to-be-tagged data set; and performing tagging processing on the to-be-tagged data set according to received tagging information, thereby obtaining a tagged data set. The data set is subjected to data screening according to the uncertainty to obtain an uncertain data set suitable for model processing; the data set is tagged, so that the model training and testing efficiency can be improved, a better effect can be achieved by less data, and the overall efficiency of supervised learning is improved. The invention furthermore discloses a data set tagging apparatus, a server and a computer readable storage medium, which have the abovementioned beneficialeffects.

Description

technical field [0001] The present application relates to the field of computer technology, and in particular to a data set labeling method, labeling device, server and computer-readable storage medium. Background technique [0002] With the development of information technology, machine learning technology has been applied to more and more fields to improve the efficiency of dealing with problems in different application scenarios. Machine learning is mainly to train through a large amount of data to obtain a more accurate recognition model. At the same time, it is necessary to continuously use the original data to test the recognition model to judge whether the recognition model meets the learning requirements. [0003] The current mainstream machine learning is still supervised learning, and labeled data is indispensable in supervised learning. With the further development of the Internet, a large amount of data is generated every day, and these data are messy, unlabeled...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/215G06F16/24573
Inventor 李云彬权圣
Owner 北京中关村科金技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products