Extended category type data quick labeling method

A data and fast technology, applied in the computer field, can solve the problems of high labeling cost and reliance on labeling tools, and achieve the effect of expanding data categories and saving labor costs and time costs

Pending Publication Date: 2019-10-15
SHANDONG LINGNENG ELECTRONIC TECH CO LTD +2
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The method can label the original data set, which is more efficient than manual labeling, but still has limitations such as relying on labeling tools and high labeling costs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Extended category type data quick labeling method
  • Extended category type data quick labeling method
  • Extended category type data quick labeling method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0027] A fast labeling method for expanded category data, such as figure 1 As shown, including the following steps:

[0028] (1) Obtain the initial data with the unified label A. The label A refers to a certain category; for example, the initial label is Vehicle; it means: directly download the public data set with the unified label A, or download based on the public Network big data training models to reason about private data sets. For example, a model based on public network big data training can label all the data as vehicle, and then use it directly to label all private data sets as vehicles to obtain the initial Data set to obtain the initial data set with uniform label A. In this embodiment, 10,000 pieces of data with a unified label of vehicle are acquired, and they are further divided into car, bus, and motobike according to requirements.

[0029] (2) According to requirements, combined with the existing data set, A is further divided into multiple sub-categories, and the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an extended category type data rapid labeling method. The method comprises the following steps of (1) obtaining an initial data set with a unified label A; (2) further dividing A into a plurality of subclasses such as a, b, c and the like in combination with the existing data set; (3) selecting a part of the initial data set as a sample, manually modifying the annotation file, and quickly renaming the annotation as a, b, c and the like from A to obtain an annotation sample; (4) based on the annotation sample, training a model by using a neural network, and pre-annotating residual data in the initial data set by using the obtained model to obtain pre-annotated data; (5) performing manual fine adjustment on the pre-annotated data to ensure that all related data labels are accurate, performing model training optimization based on the initial data and the fine adjustment data, training a better model based on more annotation samples, and finally accurately annotating all the data to improve the annotation efficiency; the method can meet the requirements of quick and accurate labeling, and better serves the practical application.

Description

Technical field [0001] The present invention belongs to the field of computer technology, and more specifically relates to a method for rapid labeling of expanded category data. Background technique [0002] In recent years, the attention of artificial intelligence technology has continued to increase, and the collection of a large number of social capital, intelligence, and data resources has driven the continuous development of artificial intelligence technology research. The classic technology of artificial intelligence is machine learning technology, and the emergence of deep learning technology enables many applications of machine learning, expanding the scope of artificial intelligence, and its foundation is to add annotation information to a large amount of data. Fast and accurate completion of data labeling is a key step in the realization of topics and projects. [0003] At present, the common data labeling method is manual labeling with the help of data labeling tools, w...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/045G06F18/241
Inventor 朱顺意范继辉瞿明军李广立刘雪健周莉巩志远陈建学杜来民邓国超白玥寅张松周雨晨
Owner SHANDONG LINGNENG ELECTRONIC TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products