Method for distributing crowdsourcing strategies based on optimal data grouping

A technology of data grouping and distribution methods, which is applied in the field of crowdsourcing data, can solve problems such as differences in the difficulty of data labeling, and achieve the effects of improving accuracy, improving accuracy, and reducing financial budget

Inactive Publication Date: 2017-08-18
EAST CHINA NORMAL UNIV
View PDF4 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the process of crowdsourcing, for annotators, the difficulty of annotating data is different.
Using the traditional unified allocation crowdsourcing strategy, assigning the same number of annotators to each labeled sample for labeling, there are certain defects in solving the budget

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for distributing crowdsourcing strategies based on optimal data grouping
  • Method for distributing crowdsourcing strategies based on optimal data grouping
  • Method for distributing crowdsourcing strategies based on optimal data grouping

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be further described in detail in conjunction with the following specific embodiments and accompanying drawings. The process, conditions, experimental methods, etc. for implementing the present invention, except for the content specifically mentioned below, are common knowledge and common knowledge in this field, and the present invention has no special limitation content.

[0024] Such as figure 1 As shown, according to an embodiment of the present invention, a crowdsourcing strategy allocation method based on optimal data grouping includes the following steps:

[0025] Step 1: Select an available grouping method based on crowdsourcing data;

[0026] Step 2: Use the coverage algorithm to extract samples from each group and hand them over to the crowdsourcing platform for labeling;

[0027] Step 3: Calculate the labeling accuracy for the samples drawn by each grouping method;

[0028] Step 4: Calculate the difference degree of labeling accu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for distributing crowdsourcing strategies based on optimal data grouping. The method for distributing crowdsourcing strategies based on optimal data grouping is characterized by comprising the steps of selecting available grouping modes according to crowdsourcing data; taking samples from each group by using a coverage algorithm and delivering the samples to a crowdsourcing platform to label; calculating the labeling accuracy in allusion to the taken samples of each grouping mode; calculating the difference degree in labeling accuracy of each grouping mode, and selecting the grouping mode with the highest difference degree to be the optimal grouping mode; and distributing crowdsourcing strategies according to the grouping. The beneficial effects are that defects of a traditional strategy of distributing the crowdsourcing strategies in a unified manner can be overcome according to the methods of labeling difficulty grouping and linear programming optimization, the financial budget of the crowdsourcing process is reduced, and the accuracy of a collected data result is also improved to a certain extent.

Description

technical field [0001] The present invention relates to crowdsourcing data, in particular to a crowdsourcing strategy allocation method based on optimal data grouping. Background technique [0002] For traditional machine learning, many data labeling tasks are difficult to complete. Such as entity matching, sentiment analysis, image annotation and other tasks. Usually, such tasks are handed over to manual labeling. There are many methods of manual labeling: selecting experts for labeling, this method has great requirements for money budget and time requirements, and has great limitations; Using crowdsourcing, the data is published and handed over to external public groups for labeling. In this process, the publisher only needs to pay a small amount of remuneration for the contributors. [0003] However, in the process of crowdsourcing, for annotators, the difficulty of annotating data is different. Using the traditional unified allocation crowdsourcing strategy, assigning...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06Q10/04G06Q10/06G06Q10/10G06K9/62
CPCG06Q10/04G06Q10/06311G06Q10/101G06F18/23213G06F18/24
Inventor 杨静江雨陈博闻
Owner EAST CHINA NORMAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products