Efficient full integer quantization method for image detection model

An image detection and model technology, applied in the field of neural networks, can solve problems such as cumbersome calculation process and insufficient use of symmetry

Inactive Publication Date: 2021-03-16
JIANGNAN INST OF COMPUTING TECH
View PDF3 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, it does not take full advantage of the symmetry in the case of symmetric quantization, making the calculation process cumbersome

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Efficient full integer quantization method for image detection model
  • Efficient full integer quantization method for image detection model
  • Efficient full integer quantization method for image detection model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0048] Embodiment: The present invention provides a high-efficiency full-integer quantization method for an image detection model. According to the process of convolution calculation of input tensor and convolution kernel weight, the calculation formula of full-integer quantization is derived, and the scale factor is shifted to the left by The bit operation becomes a large integer. After the multiplication of the large integer is completed, it is moved back to the right. The network weights, activation values ​​and convolution bias are all represented by integers. The calculation process only involves integer operations, and further uses the symmetrical quantization Features, simplifies the quantitative calculation formula, and obtains a model quantification method with efficient calculation and good model performance, which can be better applied to computing devices such as FPGA;

[0049] The weights, offsets, input feature maps and output feature maps of each layer of convolu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an efficient full-integer quantization method for a target detection model. The weight, bias, input feature map and output feature map of each convolution layer in an image detection model are all represented through integers, wherein the integer calculation is carried out in a quantization reasoning process. The method specifically comprises the following steps: carrying out normal training, quantitative perception training and quantitative parameter generation on an image detection model of a real number version, and carrying out reasoning based on full integer operation on computing equipment by applying generated parameters of each layer. The method can greatly reduce the reasoning time of the image detection model, reduces the space of the model in the aspectsof disk storage and memory occupation, maintains the high detection precision of the image detection model, and facilitates the implementation of a more efficient image target detection system on FPGAand other computing devices.

Description

technical field [0001] The invention relates to a high-efficiency full-integer quantization method for an image detection model, which belongs to the technical field of neural networks. Background technique [0002] In recent years, the convolutional neural network has made great progress in many visual tasks such as image target recognition, target detection, and pixel segmentation. It has achieved performance beyond human performance in many tasks, and many well-known network structures have emerged. Such as AlexNet, VGGNet, GoogleNet, ResNet, DenseNet, SENet, etc. The development of these convolutional networks is characterized by an increasing number of network layers and increasing capacity, resulting in an increase in network parameters and calculations, which is not conducive to deployment and application on small mobile devices with limited resources. For example, hardware platforms such as smartphones and drones hope that the model should be as small as possible wh...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06T7/0002G06N3/08G06N3/045G06F18/214
Inventor 曾明勇刘茵张昆钱磊尚江卫王相钧
Owner JIANGNAN INST OF COMPUTING TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products