Convolutional neural network quantification method and device, computer and storage medium

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of convolutional neural network and quantization method, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problems of complex implementation and high computational complexity, and can ensure acceleration effect, reduce storage space, and ensure performance. Effect

Inactive Publication Date: 2019-10-22

SHANGHAI JIAO TONG UNIV

View PDF1 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The purpose of the present invention is to provide a convolutional neural network quantization method, device and computer equipment, to improve the existing convolutional neural network quantization method complex implementation, high computational complexity

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0045] The present invention will be described in detail below in conjunction with specific embodiments. The following examples will help those skilled in the art to further understand the present invention, but do not limit the present invention in any form. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention. These all belong to the protection scope of the present invention.

[0046] refer to figure 1 As shown, the flowchart of the compression and acceleration method of the convolutional neural network in the embodiment of the present invention can refer to the following steps for details:

[0047] S1, train the full-precision model of the convolutional neural network to be quantized, and calculate the standard deviation of the weight of each layer of the full-precision model and the standard deviation of the response distribution of each layer of the full-precision mod...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a convolutional neural network quantification method, which comprises the steps of training a full-precision model of a convolutional neural network to be quantified, and calculating standard deviation of weight and response distribution of each layer of the full-precision model; estimating scale factors of parameters and features of the full-precision model according to thestandard deviation and hyper-parameters of the weight and response distribution of each layer of the full-precision model; for the to-be-optimized convolutional neural network, establishing a quantization module containing scaling factor-based forward calculation and backward gradient propagation functions to obtain a corresponding quantization network; carrying out fine tuning training on the quantization network, and determining an optimal scale factor; and retraining the quantization network generated by the optimal scaling factor to obtain a final quantization neural network model. The invention further provides a convolutional neural network quantization device, a computer and a storage medium. According to the invention, the problems of complex realization and high calculation complexity of the existing model quantification method are improved.

Description

technical field [0001] The present invention relates to a deep neural network compression method, in particular to a method, device, computer equipment and storage medium for compressing and accelerating a convolutional neural network through model quantization. Background technique [0002] In today's computer vision and other technical fields, deep learning has proven to be a very useful method, and has achieved good results in tasks such as image classification, object detection, and semantic segmentation. At present, with the continuous improvement of the theory, the deep neural network model has a tendency to develop in the direction of more parameters, deeper network, and greater calculation. At the same time, the industry is gradually applying deep learning technology to specific scenarios, which puts strict requirements on the size of the model, computing performance, power consumption and other indicators. [0003] In recent years, the application of neural network...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N3/04G06N3/08

CPCG06N3/084G06N3/045

Inventor 宋利周逸伦陈立张文军

Owner SHANGHAI JIAO TONG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Convolutional neural network quantification method and device, computer and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology