Unlock instant, AI-driven research and patent intelligence for your innovation.

A Deep Neural Network Compression Method Based on Block Item Tensor Decomposition

A technology of deep neural network and tensor decomposition, which is applied in neural learning methods, biological neural network models, neural architectures, etc., can solve the problem that deep neural networks are difficult to obtain classification accuracy, and achieve small memory usage, compressed parameter volume, and training The effect of time reduction

Inactive Publication Date: 2020-09-08
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the characteristics of "asymmetry" and "linear expressiveness" of the tensor train decomposition method itself, it is difficult for the compressed deep neural network to obtain higher classification accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Deep Neural Network Compression Method Based on Block Item Tensor Decomposition
  • A Deep Neural Network Compression Method Based on Block Item Tensor Decomposition
  • A Deep Neural Network Compression Method Based on Block Item Tensor Decomposition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0024] Such as figure 1 Shown is a schematic flow chart of the deep neural network compression method based on block item tensor decomposition of the present invention. A deep neural network compression method based on block item tensor decomposition, comprising the following steps:

[0025] A. Obtain the deep neural network framework;

[0026] B. Transform the weight matrix W and input vector x in the fully connected layer of the deep neural network into high-order tensors W and higher order tensors X ;

[0027] C. For high-order tensors in step B W Perform block item tensor decomposition pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a deep neural network compression method based on block item tensor decomposition. It includes obtaining the deep neural network framework, converting the weight matrix W and the input vector x into high-order tensors W and higher order tensors X , for higher-order tensors W Perform block item tensor decomposition processing, replace the fully connected layer of the deep neural network with the block item tensor layer, and use the back propagation algorithm to train the replaced deep neural network in step D. The present invention adopts the block item tensor decomposition method to construct the block item tensor layer to replace the fully connected layer in the original deep neural network, and utilizes the characteristics of "symmetry" and "exponential expression ability" of the block item tensor layer, not only The parameters of the fully connected layer can be greatly compressed and the classification accuracy of the original network can be maintained.

Description

technical field [0001] The invention belongs to the technical field of deep neural networks, and in particular relates to a deep neural network compression method based on block item tensor decomposition. Background technique [0002] In recent years, deep learning represented by deep neural network has achieved significantly better results than traditional methods in artificial intelligence fields such as image classification, speech recognition, and natural language processing, and has attracted widespread attention from academia, industry, and the government. However, the deep neural network has a complex structure, a large number of layers, and a huge amount of parameters. The trained model often takes up a large storage space and is difficult to port to a small terminal; and the time complexity is also high, even on a high-performance graphics processor. Online training often takes days, which forces us to compress deep neural networks. [0003] Since the weights of de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/04G06N3/082
Inventor 徐增林李广西叶锦棉陈迪
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA