Non-zero filling training method for binary convolutional neural network

A binary convolution neural and binary network technology, applied in the field of image processing, can solve the problems of performance degradation and inability to enjoy the compression ratio of binary networks, and achieve the effect of compensating for performance degradation and reducing losses

Inactive Publication Date: 2020-09-08
BEIHANG UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the current binary network research field has not paid attention to this problem, in fact, various results reported in various studies are based on the {-1, +1, 0} mixed activation "pseudo-binary network ", cannot enjoy the extremely high compression ratio of the binary network in practical applications
It has been verified by experiments that when the padding is 1, it is a non-zero-filled binary network whose activation is {-1, +1} after quantization. Compared with the zero-filled binary network with the same structure, its performance is very obvious. Decline
How to make up for the performance degradation of the non-zero padding binary network brought about by correcting the padding value, so that it can run with high precision and high compression ratio in actual deployment in the future, has become a problem that needs to be further studied.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Non-zero filling training method for binary convolutional neural network
  • Non-zero filling training method for binary convolutional neural network
  • Non-zero filling training method for binary convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The present invention will be described in further detail below in conjunction with the accompanying drawings.

[0025] The present invention is used for the non-zero padding training method of binary convolutional neural network, such as figure 1 As shown, the specific method includes the following steps:

[0026] Step 1: Prepare training samples that are applied to a specific vision task.

[0027] In the present invention, the image classification task is used as a training task, and the training set in the CIFAR-100 data set is used as a training sample. There are 100 categories of training samples, each category has 500 images, and the total number of samples is 50,000. The i-th training sample can be expressed as (x i ,y i ), where: x i represents the image data, y i Indicates the category label corresponding to the image. The image data of each sample is randomly cropped and flipped during training to enhance data and alleviate network overfitting problems. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a non-zero filling training method for a binary convolutional neural network, and the method comprises the steps: firstly improving a loss function in the training of a commonneural network, constructing a joint loss function through employing a knowledge distillation theory, and achieving the guide training of a zero filling binary network for the non-zero filling binarynetwork. And then, a progressive training method is adopted to train the non-zero filling binary network, and on the basis of the zero filling binary network, the number of activated non-zero fillingbinary values is gradually increased, so that the training difficulty of the non-zero filling binary network is reduced. According to the method, the pseudo binary activation problem in the zero-filling binary network is corrected, the training difficulty of the non-zero-filling binary network is effectively reduced by combining the joint loss function and the progressive training method, and theproblem that the performance of the non-zero-filling binary network is reduced after the filling value is corrected is greatly reduced.

Description

technical field [0001] The invention belongs to the field of image processing, and specifically refers to a non-zero padding training method for a binary convolutional neural network. Background technique [0002] In the past ten years, due to its great advantages over shallow models in feature extraction and model construction, deep learning has attracted more and more researchers' attention, and has achieved rapid development in fields such as computer vision and text recognition. Deep learning takes deep neural network as its main presentation form, and convolutional neural network (CNN) is a pioneering research result inspired by biological neuroscience. Compared with the traditional method, the convolutional neural network has the characteristics of weight sharing, local connection, pooling operation, etc., so it can effectively reduce the global optimization training parameters, reduce the complexity of the model, and make the network model more effective for input sca...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 丁文锐李越刘春蕾
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products