Data marking method, device, storage medium and electronic equipment

A data labeling and labeling technology, applied in the computer field, can solve problems such as many sample labels, few black sample labels, and difficulty in collecting black samples

Active Publication Date: 2021-05-11
HANGZHOU FRAUDMETRIX TECH CO LTD
View PDF12 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] Many sample labels are required, and the cost of manpower marking is high. Moreover, in the Internet field, it is difficult to collect black samples, so there are few black sample labels and a single sample label

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Data marking method, device, storage medium and electronic equipment
  • Data marking method, device, storage medium and electronic equipment
  • Data marking method, device, storage medium and electronic equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] Example embodiments will now be described more fully with reference to the accompanying drawings. Example embodiments may, however, be embodied in many forms and should not be construed as limited to the embodiments set forth herein; rather, these embodiments are provided so that this disclosure will be thorough and complete, and will fully convey the concept of example embodiments to those skilled in the art. The same reference numerals denote the same or similar parts in the drawings, and thus their repeated descriptions will be omitted.

[0031] Furthermore, the described features, structures, or characteristics may be combined in any suitable manner in one or more embodiments. In the following description, numerous specific details are provided in order to give a thorough understanding of embodiments of the invention. However, those skilled in the art will appreciate that the technical solutions of the present invention may be practiced without one or more of the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the present invention provide a data labeling method, device, storage medium, and electronic equipment, the method including: acquiring the labels of some data in the target data; determining the hyperparameters corresponding to the current algorithm; acquiring a model constructed based on the hyperparameters , and obtain the predicted value of the target data based on the model; sort the target data based on the predicted value, extract the target data of the first preset ratio based on the sorting, and perform a binning operation; determine the relative Whether the concentration of the black label in the target data of the previous box of the adjacent two boxes is greater than the concentration of the black label in the target data of the next box; if the judgment result is yes, extract the target data of the second preset ratio based on the sorting , marking the target data as a black label, and updating the target data based on the black label. Compared with the data labeling method proposed in the related art, the labeling of a large amount of data based on a small number of labels is realized.

Description

technical field [0001] The present invention relates to the field of computer technology, in particular to a data marking method, device, storage medium and electronic equipment. Background technique [0002] With the development of information technology, data-based prediction and processing are becoming more and more frequent. In some scenarios, data needs to be marked to use the marked data for further processing. [0003] Some data labeling methods have been proposed in related technologies, for example: [0004] The first type of method uses a lot of manpower to label the data. For example, outsource 100 people to label the data, and then use a supervised algorithm to model it. [0005] The second class of methods, using the label propagation algorithm, is based on the existing few label propagation. [0006] The third type of method uses active learning algorithms to manually mark samples with poor model recognition effects. [0007] In the process of realizing the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/955G06F16/906G06K9/62
CPCG06F16/955G06F16/906G06F18/2411
Inventor 张文会廖剑
Owner HANGZHOU FRAUDMETRIX TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products