Deep neural network compression method based on block item tensor decomposition

A deep neural network and tensor decomposition technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve problems such as difficulty in obtaining classification accuracy for deep neural networks

Inactive Publication Date: 2018-04-20
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF6 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, due to the characteristics of "asymmetry" and "linear expressiveness" of the tensor train decomposition method itself, it is difficult for the compressed deep neural network to obtain higher classification accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep neural network compression method based on block item tensor decomposition
  • Deep neural network compression method based on block item tensor decomposition
  • Deep neural network compression method based on block item tensor decomposition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0024] Such as figure 1 Shown is a schematic flow chart of the deep neural network compression method based on block item tensor decomposition of the present invention. A deep neural network compression method based on block item tensor decomposition, comprising the following steps:

[0025] A. Obtain the deep neural network framework;

[0026] B. Transform the weight matrix W and input vector x in the fully connected layer of the deep neural network into high-order tensors W and higher order tensors X ;

[0027] C. For high-order tensors in step B W Perform block item tensor decomposition pr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention discloses a deep neural network compression method based on block item tensor decomposition. The method comprises the steps of: obtaining a framework of a deep neural network, respectively converting a weight matrix W and an input vector x to a high-order tensor (u)W(/u) and a high-order tensor (u)X(/u), performing block item tensor decomposition processing of the high-ordertensor (u)W(/u), replacing a full-connection layer of the deep neural network with a block item tensor layer, and employing a back-propagating algorithm to perform training of the deep neural networkreplaced in the step D. The deep neural network compression method based on the block item tensor decomposition employs a block item tensor decomposition method to construct the block item tensor layer to replace the full- connection layer of the original deep neural network, the features of symmetry and index expression capability of the block item tensor layer are employed to greatly compress the parameter amount of the full-connection layer and maintain the classification precision of the original network.

Description

technical field [0001] The invention belongs to the technical field of deep neural networks, and in particular relates to a deep neural network compression method based on block item tensor decomposition. Background technique [0002] In recent years, deep learning represented by deep neural network has achieved significantly better results than traditional methods in artificial intelligence fields such as image classification, speech recognition, and natural language processing, and has attracted widespread attention from academia, industry, and the government. However, the deep neural network has a complex structure, a large number of layers, and a huge amount of parameters. The trained model often takes up a large storage space and is difficult to port to a small terminal; and the time complexity is also high, even on a high-performance graphics processor. Online training often takes days, which forces us to compress deep neural networks. [0003] Since the weights of de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/04G06N3/082
Inventor 徐增林李广西叶锦棉陈迪
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products