A partial binary convolution method suitable for embedded devices

An embedded device and convolution method technology, applied in the direction of neural architecture, biological neural network model, etc., can solve the problems of decreased accuracy, unconsidered importance, etc., and achieve the effect of reducing calculations

Active Publication Date: 2018-12-25
CHONGQING UNIV
View PDF13 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This is mainly because: Binary quantization is performed uniformly for all convolution kernels without considering the different i...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A partial binary convolution method suitable for embedded devices
  • A partial binary convolution method suitable for embedded devices
  • A partial binary convolution method suitable for embedded devices

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Below in conjunction with accompanying drawing and embodiment the present invention will be further described:

[0023] The information processing process of the present invention is as figure 1 as shown, figure 1 Contains 4 subgraphs: convolution kernel grouping, convolution kernel rearrangement, channel rearrangement and model fine-tuning.

[0024] 1. The convolution kernel grouping subgraph is step 1 of the present invention, that is, for each convolution layer of a given depth convolutional neural network, the importance of each convolution kernel is calculated according to the convolution kernel importance calculation formula; And sort the convolution kernels of each layer according to the importance, set the convolution kernel importance threshold, and divide the convolution kernels larger than the threshold and smaller than the threshold into two groups. N in the figure c is the number of channels of the input feature map, h i / w i is the height and width of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a partial binary convolution method suitable for embedded equipment, belonging to a model compression method of depth learning. The method comprises the following steps: 1. foreach convolution layer of a given CNN, measuring the importance of each convolution core according to the statistics of each output characteristic map; the convolution kernels of each layer being divided into two groups. 2, rearrangeing that two groups of convolution core on the storage space so that the storage positions of the same group of convolution cores are adjacent to each other, recording the rearrange order to generate a new convolution layer; 3, changing the channel order of the convolution nucleus of the next convolution layer according to the rearrangement order of the step 2; Step 4, according to the CNN processed in the above steps, fine-tuning training being carried out, binary quantization being carried out on the convolution layer divided into non-important layers, and the accuracy of the whole network being gradually recovered through the iterative operation of quantization and training. The invention ensures the accuracy of the given CNN on the large data set, andreduces the calculation and storage overhead of the convolution neural network on the embedded equipment.

Description

technical field [0001] The invention belongs to the field of model compression of deep learning, and in particular relates to a partial binary convolution method suitable for embedded devices. Background technique [0002] With the rapid development of deep convolutional neural networks (CNNs), CNNs have become state-of-the-art in many task domains, such as image classification, object recognition, and semantic segmentation. Traditionally, a data center equipped with a high-end image processor (GPU) is the best choice for deploying CNN applications, but this cloud-centric application framework usually causes some problems, such as user privacy issues, long response time, These apps cannot even be used without the internet. Therefore, people started to deploy CNN directly on embedded devices. [0003] CNN models often require a lot of computing resources and storage resources. For example, AlexNet with 5 convolutional layers requires 722M floating point number operations (F...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06N3/04
CPCG06N3/045
Inventor 刘铎凌英剑梁靓
Owner CHONGQING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products