Data annotation method

A data labeling and data technology, applied in the field of technology foresight, can solve problems such as incompetence in data labeling work, and achieve the effect of improving analysis efficiency, improving labeling accuracy, and improving capabilities

Inactive Publication Date: 2018-02-23
HUAZHONG UNIV OF SCI & TECH +1
View PDF7 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Based on the above reasons, the data labeling task of technical foresight has high domain knowledge requirements for the labeler, and the technology disclosed in the Chinese patent application with the publication number CN106489149A is not competent for the data labeling work in the field of technology foresight

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data annotation method
  • Data annotation method
  • Data annotation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be described below according to the embodiments shown in the accompanying drawings. It can be thought that embodiment disclosed this time is an illustration in every point, and is not restrictive.

[0045] figure 1 It is a schematic diagram of the architecture of the data labeling system in this embodiment. Such as figure 1 As shown, the multi-source heterogeneous data labeling system includes a terminal 1 for task publishers, a labeling platform 2 and a terminal 3 for labelers. The labeling platform 2 is communicatively connected to the task issuer terminal 1 and the labeler terminal 3 through the networks 4 and 5 respectively. The above-mentioned terminal 1 for the task issuer and the terminal 3 for the annotator may be terminal devices such as personal computers, Pads, and mobile phones. The above-mentioned labeling platform 2 may be a platform device such as a server. The above-mentioned networks 4 and 5 may be wired networks or wirel...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a data annotation method. The method comprises the step of data annotation task allocation, wherein according to a data identification code of to-be-annotated data and an identification code of an annotator, a to-be-annotated data annotation task is matched with the annotator, and according to a matching result, the to-be-annotated data annotation task is allocated to the annotator; the step of data annotation, wherein according to the required annotation form, the to-be-annotated data is annotated; the step of collection and integration, wherein after the annotation results of the to-be-annotated data annotation task are all submitted, according to the annotation scores of the annotator and the annotation results, the annotation result is integrated, and an accuratelabel is doped out.

Description

Technical field: [0001] The invention relates to the technical foresight field, in particular to a multi-source heterogeneous data labeling system based on swarm intelligence. technical background: [0002] In recent years, with the rapid development of computer technology and the Internet, various forms of big data have emerged. However, the increase in the amount of data has made it extremely difficult and expensive to manually label corpus. Labeling and application challenges, so the technology crowdsourcing platform came into being. However, crowdsourcing platforms have disadvantages such as large investment, low efficiency, small amount of data processing, and unguaranteed annotation quality. [0003] For the above technical problems, the Chinese patent application with publication number CN106489149A discloses a data labeling method and system based on data mining and crowdsourcing. This patent proposes a unique method to mark the labeling results during the labeling...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/2465
Inventor 陈吉红陈峥周源杨建中刘宇飞张凯林亨董放
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products