Deep convolutional neural network-based inter-layer non-uniform K-means clustering fixed-point quantification method

A deep convolution and neural network technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problems of excessive storage overhead and reduction, and achieve the effect of reducing storage overhead and wide application prospects

Pending Publication Date: 2017-06-27
南京风兴科技有限公司
View PDF8 Cites 23 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Purpose of the invention: the technical problem to be solved by the present invention is to provide a non-uniform K-average clustering fixed-point quantization method based on deep convolutional neural network layers for the problem of excessive feature map storage overhead in deep convolutional neural networks, Thus, the storage overhead can be greatly reduced while maintaining the accuracy of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep convolutional neural network-based inter-layer non-uniform K-means clustering fixed-point quantification method
  • Deep convolutional neural network-based inter-layer non-uniform K-means clustering fixed-point quantification method
  • Deep convolutional neural network-based inter-layer non-uniform K-means clustering fixed-point quantification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The core idea of ​​the present invention is to use the redundancy of the feature map of the deep convolutional network, use the non-uniform K-average clustering fixed-point quantization method in the layer to fix the feature map, reduce the storage requirement by storing index values ​​and use retraining Fine-tune the model to compensate for the error caused by fixed-pointing.

[0033] The invention discloses a non-uniform K-average clustering fixed-point quantization method in a layer based on a deep convolutional neural network, comprising the following steps:

[0034] Step 1: Forward test the existing deep convolutional neural network. Since the feature maps of correct samples are often more representative of the distribution of feature maps of most samples, samples that can be correctly identified are selected. And extract the feature maps generated during the recognition process.

[0035] Step 2: Carry out non-regular fixed-point quantization between layers on the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep convolutional neural network-based inter-layer non-uniform K-means clustering fixed-point quantification method. The method includes the following steps that: step 1, a part of images of a deep convolutional neural network which can be correctly identified are selected, and feature mappings (Feature Map) generated in an identification process are extracted; step 2, inter-layer non-regular quantification is performed on the feature mappings in the deep convolutional neural network, and the maximum number of quantification bits of each layer of the convolutional network is determined with the precision of the model maintained; step 3, for each convolutional layer in the model, a K-means clustering algorithm is used to determine fixed-point values satisfying feature mapping distribution, the range of the fixed-point values is made to be located in a range which can be expressed by the maximum number of quantification bits, and the fixed-point values are adopted to represent values in the feature mappings and are stored in the form of indexes; and step 4, a neural network model fine tuning method is adopted perform fine tuning on the model, so that error caused by the quantification can be eliminated. With the inter-layer non-uniform K-means clustering fixed-point quantification method adopted, the storage cost of the feature mappings of the deep convolutional neural network can be greatly reduced with the precision of the model maintained. The method is innovative.

Description

technical field [0001] The invention relates to the field of deep learning model compression, in particular to the fixed-point field of deep convolutional neural networks oriented to embedded systems. Background technique [0002] With the rapid development of artificial intelligence, more and more applications designed with deep learning-led algorithms have appeared in people's lives, work and entertainment. However, a deep neural network is often composed of dozens or even hundreds of convolutional layers, and the feature maps generated during the calculation process need to occupy a large amount of storage space. This means that the product area is greatly increased for embedded applications. Therefore, it is of great significance to study the fixed-point compression problem of deep convolutional neural networks to reduce the storage overhead of feature maps and improve the practical value of deep learning. [0003] The fixed-point quantization of feature maps in the cu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/063G06N3/08
CPCG06N3/063G06N3/08G06N3/045G06F18/23213
Inventor 王中风孙方轩林军
Owner 南京风兴科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products