Unlock instant, AI-driven research and patent intelligence for your innovation.

Quantification method, device, electronic equipment and medium of a deep neural network

A technology of deep neural network and quantization method, which is applied in the field of device, electronic equipment and computer readable storage medium, and quantization method of deep neural network, which can solve the problem of not considering redundant information, increasing the complexity and speed of calculation processing, etc. problem, achieve the effect of speeding up network computing and reducing the amount of data processing

Active Publication Date: 2022-03-22
INSPUR BEIJING ELECTRONICS INFORMATION IND
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, in the prior art, when the deep neural network is quantized after training, it does not take into account the redundant information in the network structure itself generated by training. These redundant information have little influence on the final inference results, but increase Increased computational complexity and speed

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Quantification method, device, electronic equipment and medium of a deep neural network
  • Quantification method, device, electronic equipment and medium of a deep neural network
  • Quantification method, device, electronic equipment and medium of a deep neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0056] The core of the present application is to provide a quantization method, device, electronic equipment and computer-readable storage medium of a deep neural network, so as to effectively simplify the processing speed and calculation amount of quantization after training on the basis of ensuring the accuracy of the deep neural network.

[0057] In order to describe the technical solutions in the embodiments of the present application more clearly and completely, the technical solutions in the embodiments of the present application will be introduced below in conjunction with the drawings in the embodiments of the present application. Apparently, the described embodiments are only some of the embodiments of this application, not all of them. Based on the embodiments in this application, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the scope of protection of this application.

[0058] At present, with the c...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a quantization method, device, electronic device and computer-readable storage medium of a deep neural network. The method includes: obtaining the floating-point model of the deep neural network generated by pre-training; calculating each quantization unit in the deep neural network The importance evaluation value of ; the type of quantization unit includes the weight parameter channel and / or hidden layer of the deep neural network, the weight parameter channel is used as the quantization unit for weight parameter quantization, and the hidden layer is used as the quantization unit for activation Output quantization; determine the quantization digits corresponding to each quantization unit; the quantization digits and the importance evaluation value of each quantization unit change positively; quantize the floating-point model according to the quantization digits. This application sets the number of quantization digits related to the importance of information for different weight parameter channels and / or hidden layers, and maximizes quantization compression while ensuring accuracy, reducing data processing and speeding up calculations.

Description

technical field [0001] The present application relates to the technical field of artificial intelligence, and in particular to a quantization method, device, electronic equipment and computer-readable storage medium of a deep neural network. Background technique [0002] With the continuous development of artificial intelligence technology, artificial intelligence technology has gradually been applied in our life. In the field of artificial intelligence technology, deep learning is one of the more typical technologies. Although the capabilities of deep neural networks in image classification and detection are close to or surpass those of humans, in actual deployment, there are still problems such as large models and high computational complexity, which require high hardware costs. In practical applications, neural networks are often deployed on some terminal devices or edge devices, but terminal devices or edge devices generally have low computing power, and memory and powe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 梁玲燕董刚赵雅倩
Owner INSPUR BEIJING ELECTRONICS INFORMATION IND