A partial binary convolution method suitable for embedded devices

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An embedded device and convolution method technology, applied in the direction of neural architecture, biological neural network model, etc., can solve the problems of decreased accuracy, unconsidered importance, etc., and achieve the effect of reducing calculations

Active Publication Date: 2018-12-25

CHONGQING UNIV

View PDF13 Cites 14 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

This is mainly because: Binary quantization is performed uniformly for all convolution kernels without considering the different importance of different convolution kernels. If important convolution kernels are binary quantized, it may lead to a decrease in accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0022] Below in conjunction with accompanying drawing and embodiment the present invention will be further described:

[0023] The information processing process of the present invention is as figure 1 as shown, figure 1 Contains 4 subgraphs: convolution kernel grouping, convolution kernel rearrangement, channel rearrangement and model fine-tuning.

[0024] 1. The convolution kernel grouping subgraph is step 1 of the present invention, that is, for each convolution layer of a given depth convolutional neural network, the importance of each convolution kernel is calculated according to the convolution kernel importance calculation formula; And sort the convolution kernels of each layer according to the importance, set the convolution kernel importance threshold, and divide the convolution kernels larger than the threshold and smaller than the threshold into two groups. N in the figure c is the number of channels of the input feature map, h i / w i is the height and width of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a partial binary convolution method suitable for embedded equipment, belonging to a model compression method of depth learning. The method comprises the following steps: 1. foreach convolution layer of a given CNN, measuring the importance of each convolution core according to the statistics of each output characteristic map; the convolution kernels of each layer being divided into two groups. 2, rearrangeing that two groups of convolution core on the storage space so that the storage positions of the same group of convolution cores are adjacent to each other, recording the rearrange order to generate a new convolution layer; 3, changing the channel order of the convolution nucleus of the next convolution layer according to the rearrangement order of the step 2; Step 4, according to the CNN processed in the above steps, fine-tuning training being carried out, binary quantization being carried out on the convolution layer divided into non-important layers, and the accuracy of the whole network being gradually recovered through the iterative operation of quantization and training. The invention ensures the accuracy of the given CNN on the large data set, andreduces the calculation and storage overhead of the convolution neural network on the embedded equipment.

Description

technical field [0001] The invention belongs to the field of model compression of deep learning, and in particular relates to a partial binary convolution method suitable for embedded devices. Background technique [0002] With the rapid development of deep convolutional neural networks (CNNs), CNNs have become state-of-the-art in many task domains, such as image classification, object recognition, and semantic segmentation. Traditionally, a data center equipped with a high-end image processor (GPU) is the best choice for deploying CNN applications, but this cloud-centric application framework usually causes some problems, such as user privacy issues, long response time, These apps cannot even be used without the internet. Therefore, people started to deploy CNN directly on embedded devices. [0003] CNN models often require a lot of computing resources and storage resources. For example, AlexNet with 5 convolutional layers requires 722M floating point number operations (F...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N3/04

CPCG06N3/045

Inventor 刘铎凌英剑梁靓

Owner CHONGQING UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A partial binary convolution method suitable for embedded devices

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology