Sample data processing method and device, server and storage medium

A sample data and processing method technology, applied in the Internet field, can solve problems such as differences in understanding and mastery, affecting the quality of sample data labeling, confusion or errors, etc.

Pending Publication Date: 2019-09-20
ADVANCED NEW TECH CO LTD
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, different labelers often have different understandings and grasps of labeling rules and sample data. As a result, after the same sample data is marked by different labelers, there may be multiple different labeling information, which makes subsequent use There will be confusion or errors in the above-mentioned labeled sample data, which will affect the labeling quality of the sample data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sample data processing method and device, server and storage medium
  • Sample data processing method and device, server and storage medium
  • Sample data processing method and device, server and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] In order to enable those skilled in the art to better understand the technical solutions in this specification, the technical solutions in the embodiments of this specification will be clearly and completely described below in conjunction with the drawings in the embodiments of this specification. Obviously, the described The embodiments are only some of the embodiments in this specification, not all of them. Based on the embodiments in this specification, all other embodiments obtained by persons of ordinary skill in the art without creative efforts shall fall within the protection scope of this specification.

[0021] Considering the labeling method based on the existing sample data, the labeler is susceptible to personal subjective influence when marking, resulting in the quality of labeling cannot be guaranteed. At the same time, if multiple labelers are called to mark the same batch of sample data at the same time, because different labelers have different understa...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a sample data processing method and device, a server and a storage medium. The method comprises the steps that multiple pieces of target sample data are acquired, and the target sample data carry annotation information; a marking information entropy of the target sample data is determined according to marking information carried by the target sample data; and according to the labeling information entropy of the target sample data, first target data of which the labeling quality meets a preset quality requirement is determined from the plurality of target sample data. In the embodiment of the specification, the consistency degree of different labeling sources for labeling the same sample data is quantified by firstly determining the labeling information entropy capable of reflecting the uncertainty of the labeling information of the target sample data; therefore, the target sample data with relatively high labeling quality can be screened out according to the labeling information entropy to be used as the first target data, so that the data with relatively high labeling quality can be efficiently and accurately screened out from the plurality of target sample data, and the data error is reduced.

Description

technical field [0001] This specification belongs to the technical field of the Internet, and in particular relates to a sample data processing method, device, server and storage medium. Background technique [0002] When using sample data for model training, it is usually necessary to label the sample data used first. [0003] For example, the labeler in charge of labeling usually analyzes and judges the attributes of each sample data according to the pre-determined labeling rules, and then sets corresponding labeling information for each sample data according to the judgment results to indicate the attributes of the sample data Features (such as the type or level corresponding to the sample data, etc.), to complete the labeling of the sample data. Then, specific model training can be carried out based on the above-mentioned labeled sample data. [0004] When labeling the sample data according to the labeling rules, the labeler may be subject to personal subjective influe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/24G06F18/214
Inventor 郭亚赵智源周书恒祝慧佳
Owner ADVANCED NEW TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products