A face noise dataset CNN training method based on the overall cosine distribution

A data set and training data set technology, applied in the field of image recognition, can solve the problems of difficult acquisition of prior knowledge, large fluctuation of loss value, poor recognition effect, etc., achieving small memory resources, large representation gains, and small computing resources. Effect

Active Publication Date: 2021-05-11
BEIJING YUNSHITU INFORMATION TECH CO LTD
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] 2.2) The dependence on hyperparameters is serious, and the convergence of the training process is difficult;
[0014] 2.3) For symmetrical intra-class noise, the recognition effect is relatively poor;
[0017] 3.1) This method needs to pre-determine the noise ratio of the data set. Usually, this prior knowledge is difficult to obtain, thus limiting its usage scenarios;
[0018] 3.2) Since the loss value of a single small batch sample set fluctuates greatly, it is not very accurate to filter noise data based on the small loss of a single small batch sample set;
[0019] 3.3) This method requires an isomorphic dual network structure, and the video memory usage and calculation consumption are basically twice that of the general network, which is not very practical for large networks with limited video memory;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A face noise dataset CNN training method based on the overall cosine distribution
  • A face noise dataset CNN training method based on the overall cosine distribution
  • A face noise dataset CNN training method based on the overall cosine distribution

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] The present invention proposes a face noise data set CNN training method based on the overall cosine distribution, which will be further described in detail below in conjunction with the accompanying drawings and specific embodiments. The embodiments described in the present invention are exemplary, and are only used for explaining the present invention, but not construed as limiting the present invention.

[0066] The present invention proposes a kind of face noise data set CNN training method based on overall cosine distribution, comprises the following steps:

[0067] 1) Select the face training data set containing noise and record it as D all (the human face training data set comprises human face sample picture and the corresponding label of each picture, can adopt off-the-shelf human face training data set, wherein the number of label categories is not less than 1000, the number of human face sample pictures of each category Not less than 10), construct a benchmar...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention proposes a face noise data set CNN training method based on overall cosine distribution, which belongs to the field of image recognition. This method first selects the face training data set containing noise and builds a benchmark convolutional neural network backbone; adds layers to the convolutional neural network backbone to obtain an updated convolutional neural network; obtains small Batch sample set, input each face picture sample in the small batch sample set into the updated network, get the corresponding cosine value and add the cosine distribution; calculate the corresponding training weight for each cosine value by designing three strategies and fusion functions , use the gradient descent algorithm to learn the updated network, and when the number of iterations reaches the upper limit, the training is terminated. The method has good robustness and can quickly and efficiently train noisy face training datasets without prior knowledge.

Description

technical field [0001] The invention relates to the field of image recognition. In particular, it relates to a face noise dataset CNN training method based on the overall cosine distribution. Background technique [0002] In recent years, deep learning technology has made rapid progress in both algorithms and hardware, and the scale of data it relies on has become larger and larger. Taking face recognition as an example, training a convolutional neural network for face recognition CNN requires large-scale training data sets. Although most of these training data sets are cleaned and filtered manually or by machines, there is still a certain proportion of noise. Face training data refers to face sample pictures and their corresponding labels, and face noise data refers to face sample pictures marked with labels that are not real labels. The IMDB-Face method mentioned that Microsoft's MS-Celeb-1M training data set contains nearly 50% noise data, and the depth model trained di...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06K9/00
CPCG06V40/172
Inventor 黄杨昱胡伟袁国栋
Owner BEIJING YUNSHITU INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products