Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An Image Recognition Method Based on Improved Focal Loss Function

A loss function and image recognition technology, which is applied in the field of multi-label image recognition, can solve the problems of not fully utilizing all samples, lack of partial sample data, and huge calculation amount, so as to reduce the amount of calculation, increase the loss value, and reduce the loss value. Effect

Active Publication Date: 2022-07-26
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

For some methods to solve the sample imbalance, some do not make full use of all samples, resulting in the lack of part of the sample data, and some use all the samples, which makes the calculation amount very large and increases the cost

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An Image Recognition Method Based on Improved Focal Loss Function
  • An Image Recognition Method Based on Improved Focal Loss Function
  • An Image Recognition Method Based on Improved Focal Loss Function

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0028] In order to illustrate the technical solution of the present invention more clearly, the technical solution of the present invention is further described in detail below in conjunction with the accompanying drawings:

[0029] The present invention proposes an image recognition method based on the improved Focal loss function, such as figure 1 shown, including the following steps:

[0030] Step 1: First divide the samples into training set, test set and remaining sample set, and then use the training set to train the convolutional neural network model based on the improved Focal loss function.

[0031] Train an improved model M based on Focal loss, such as figure 2As shown, the convolutional neural network model has five hierarchical structures: input layer, convolutional layer, activation layer, pooling layer, and fully connected layer. In this convolutional neural network structure, the input layer inputs image information, and the size of each image is 128*128. Th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

This patent discloses an image recognition method based on an improved Focal loss function. On the basis of the existing Focal loss function, the modulation factor of the Focal loss function is improved, so that the function pays more attention to difficult samples, and the The attention of simple samples is relatively reduced; then, based on the convolutional neural network model based on the Focal loss function, the remaining negative sample sets are predicted, all difficult samples are screened, and divided into N equal parts, which are added to the original training set respectively, N new training sets are formed, then multiple models are trained, and the final predicted image label result is determined according to the voting selection of the N models. On the basis of the original Focal loss function, the present invention pays more attention to difficult samples and improves the generalization ability of the model; in the case of too many negative samples, the original model is used to screen out difficult samples and add them to the training set, not only The accuracy of the model is improved, the available samples are fully utilized, and the amount of computation when training the model is reduced.

Description

technical field [0001] The invention belongs to the technical field of multi-label image recognition based on deep learning, and in particular relates to an image recognition method based on an improved Focal loss function. Background technique [0002] Convolutional Neural Network (CNN) is a neural network specially designed for image recognition problems. CNN can effectively obtain the features of the original image, and it rarely goes through the preprocessing process. Compared with traditional machine learning, the convolutional neural network can automatically extract the image features, which originates from two important characteristics of the convolutional neural network: sparse connection and weights are shared. Sparse connection means that the nodes of the convolutional layer are only connected to some nodes of the previous layer, and are only used to learn local features. Weight sharing means that in the process of scanning the image with the convolution kernel,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V10/764G06K9/62G06V10/774G06V10/82G06N3/04G06N3/08
CPCG06N3/04G06N3/08G06F18/241G06F18/214
Inventor 周世界孙广鹏
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products