Convolutional neural network quantification method and device, computer and storage medium

A technology of convolutional neural network and quantization method, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problems of complex implementation and high computational complexity, and can ensure acceleration effect, reduce storage space, and ensure performance. Effect

Inactive Publication Date: 2019-10-22
SHANGHAI JIAO TONG UNIV
View PDF1 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The purpose of the present invention is to provide a convolutional neural network quantization method, device and computer equ

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network quantification method and device, computer and storage medium
  • Convolutional neural network quantification method and device, computer and storage medium
  • Convolutional neural network quantification method and device, computer and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The present invention will be described in detail below in conjunction with specific embodiments. The following examples will help those skilled in the art to further understand the present invention, but do not limit the present invention in any form. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention. These all belong to the protection scope of the present invention.

[0046] refer to figure 1 As shown, the flowchart of the compression and acceleration method of the convolutional neural network in the embodiment of the present invention can refer to the following steps for details:

[0047] S1, train the full-precision model of the convolutional neural network to be quantized, and calculate the standard deviation of the weight of each layer of the full-precision model and the standard deviation of the response distribution of each layer of the full-precision mod...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a convolutional neural network quantification method, which comprises the steps of training a full-precision model of a convolutional neural network to be quantified, and calculating standard deviation of weight and response distribution of each layer of the full-precision model; estimating scale factors of parameters and features of the full-precision model according to thestandard deviation and hyper-parameters of the weight and response distribution of each layer of the full-precision model; for the to-be-optimized convolutional neural network, establishing a quantization module containing scaling factor-based forward calculation and backward gradient propagation functions to obtain a corresponding quantization network; carrying out fine tuning training on the quantization network, and determining an optimal scale factor; and retraining the quantization network generated by the optimal scaling factor to obtain a final quantization neural network model. The invention further provides a convolutional neural network quantization device, a computer and a storage medium. According to the invention, the problems of complex realization and high calculation complexity of the existing model quantification method are improved.

Description

technical field [0001] The present invention relates to a deep neural network compression method, in particular to a method, device, computer equipment and storage medium for compressing and accelerating a convolutional neural network through model quantization. Background technique [0002] In today's computer vision and other technical fields, deep learning has proven to be a very useful method, and has achieved good results in tasks such as image classification, object detection, and semantic segmentation. At present, with the continuous improvement of the theory, the deep neural network model has a tendency to develop in the direction of more parameters, deeper network, and greater calculation. At the same time, the industry is gradually applying deep learning technology to specific scenarios, which puts strict requirements on the size of the model, computing performance, power consumption and other indicators. [0003] In recent years, the application of neural network...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/084G06N3/045
Inventor 宋利周逸伦陈立张文军
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products