Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Neural network training method and device suitable for long-tail distribution data set

A technology for distributing data and neural networks, applied in the field of neural network training methods and devices

Pending Publication Date: 2020-12-18
TSINGHUA UNIV
View PDF0 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] Most of the above three methods can alleviate the difficulty of neural network identification under the long-tail distribution, but they also have their own limitations, and at the same time, the analysis of the characteristics of the neural network itself is insufficient.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network training method and device suitable for long-tail distribution data set
  • Neural network training method and device suitable for long-tail distribution data set
  • Neural network training method and device suitable for long-tail distribution data set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] Embodiments of the present invention are described in detail below, and examples of the embodiments are shown in the drawings, wherein the same or similar reference numerals denote the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary and are intended to explain the present invention and should not be construed as limiting the present invention.

[0039] Long-tail distribution is a form of data distribution that is widely applicable to human daily life. Taking the download of songs and software on the Internet as an example, the top few popular songs and software will be downloaded in large numbers. The number of downloads of a large number of songs and software is getting lower and lower, but even songs and software with very low popularity still maintain a certain amount of downloads. This type of data distribution belongs to the long-tail distribution. In add...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a neural network training method and device suitable for a long-tail distribution data set. The neural network comprises a feature extraction network, a classifier and a category gradient reweighting network. The training method comprises the following steps: obtaining a training sample set; extracting, bythe feature extraction network, features from the training sample setto obtain features, classifying the features through a classifier, and establishing a loss function according to a classification result; calculating the gradient of each neuron in the feature extraction network in the training sample according to the loss function; and in the back propagation process of neural network training, calculating, by the class gradient reweighting network, the reweighting gradient weight of the training sample, and adjusting the gradients of the training samples belonging to different classes according to the reweighting gradient weight. Therefore, the method solvesa problem that the recognition accuracy of the neural network is reduced under the training data of the long-tail distribution, alleviates the overfitting phenomenon of the feature extraction network, and improves the recognition accuracy and robustness of the deep neural network under the long-tail distribution.

Description

technical field [0001] The present invention relates to the technical fields of artificial intelligence and deep learning, in particular to a neural network training method and device suitable for long-tail distribution data sets. Background technique [0002] With the rapid development of deep learning and neural networks, deep learning techniques are widely used in computer vision applications, such as object recognition, object detection, semantic segmentation, etc. Training a neural network often requires training data with sufficient data volume and balanced data distribution. However, collecting such training data often requires a lot of manpower and material resources. In the early target recognition algorithms based on neural networks, data sets with balanced data and small data volume were often used, such as MNIST and CIFAR. The former is training data for handwritten digit recognition, and the latter is training data for general object recognition, and its categ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/08G06K9/62
CPCG06N3/084G06F18/214G06F18/24
Inventor 丁贵广项刘宇
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products