A Deep Neural Network Compression Method Based on Block Item Tensor Decomposition

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of deep neural network and tensor decomposition, which is applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problem that deep neural networks are difficult to obtain classification accuracy, and achieve small memory usage, compressed parameter volume, and training The effect of time reduction

Inactive Publication Date: 2020-09-08

UNIV OF ELECTRONICS SCI & TECH OF CHINA

View PDF6 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, due to the characteristics of "asymmetry" and "linear expressiveness" of the tensor train decomposition method itself, it is difficult for the compressed deep neural network to obtain higher classification accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0024] Such as figure 1 Shown is a schematic flow chart of the deep neural network compression method based on block item tensor decomposition of the present invention. A deep neural network compression method based on block item tensor decomposition, comprising the following steps:

[0025] A. Obtain the deep neural network framework;

[0026] B. Transform the weight matrix W and input vector x in the fully connected layer of the deep neural network into high-order tensors W and higher order tensors X ;

[0027] C. For high-order tensors in step B W Perform block item tensor decomposition pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a deep neural network compression method based on block item tensor decomposition. It includes obtaining the deep neural network framework, converting the weight matrix W and the input vector x into high-order tensors W and higher order tensors X , for higher-order tensors W Perform block item tensor decomposition processing, replace the fully connected layer of the deep neural network with the block item tensor layer, and use the back propagation algorithm to train the replaced deep neural network in step D. The present invention adopts the block item tensor decomposition method to construct the block item tensor layer to replace the fully connected layer in the original deep neural network, and utilizes the characteristics of "symmetry" and "exponential expression ability" of the block item tensor layer, not only The parameters of the fully connected layer can be greatly compressed and the classification accuracy of the original network can be maintained.

Description

technical field [0001] The invention belongs to the technical field of deep neural networks, and in particular relates to a deep neural network compression method based on block item tensor decomposition. Background technique [0002] In recent years, deep learning represented by deep neural network has achieved significantly better results than traditional methods in artificial intelligence fields such as image classification, speech recognition, and natural language processing, and has attracted widespread attention from academia, industry, and the government. However, the deep neural network has a complex structure, a large number of layers, and a huge amount of parameters. The trained model often takes up a large storage space and is difficult to port to a small terminal; and the time complexity is also high, even on a high-performance graphics processor. Online training often takes days, which forces us to compress deep neural networks. [0003] Since the weights of de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06N3/08G06N3/04

CPCG06N3/04G06N3/082

Inventor 徐增林李广西叶锦棉陈迪

Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA

A Deep Neural Network Compression Method Based on Block Item Tensor Decomposition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology