Deep network compression method based on dimensional adaptation Tucker decomposition

A deep network and compression method technology, applied in neural learning methods, biological neural network models, neural architectures, etc., can solve problems such as network compression, waste of storage space, etc., and achieve high compression ratio, effective utilization, The effect of effective storage space

Active Publication Date: 2017-12-26
PEKING UNIV
View PDF4 Cites 36 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the parameters are sparse, the problem with this method is that it needs to record the coordinate guidance of non-zero elements, which will waste a large part of storage space
The problem with this method is that tensor decomposition can only solve fixed-order tensors, and the form of decomposition is fixed. Tensor decomposition is mainly achieved by adding more layers; although this method has achieved good results in network acceleration effect, but due to the limitation of decomposition, the original tensor decomposition method cannot solve the problem of network compression very well

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep network compression method based on dimensional adaptation Tucker decomposition
  • Deep network compression method based on dimensional adaptation Tucker decomposition
  • Deep network compression method based on dimensional adaptation Tucker decomposition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0069] Below in conjunction with accompanying drawing, further describe the present invention through embodiment, but do not limit the scope of the present invention in any way.

[0070] The present invention provides a deep network compression method based on dimension adaptive Tucker decomposition, Figure 7 It is a block flow diagram of the deep network compression method of the Tucker decomposition of dimension adaptive adjustment; the flow chart of the deep network compression method of the Tucker decomposition of shared dimension adaptive adjustment is similar to it; the method of the present invention is based on the Tucker decomposition of tensor, aimed at deep network , by adapting and adjusting the size of each dimension of the tensor, a new tensor of any order is generated, and then the tensor decomposition is realized through the learnable kernel tensor and transfer matrix, including the dimension adaptive adjustment process and the weight tensor of dimension adapta...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep network compression method based on dimensional adaptation adjusting Tucker decomposition. The method comprises a dimensional adaptation adjusting process and a dimensional adaptation weight tensor decomposition process. Through adaptively adjusting a size of each dimensions of a tensor, a new any order tensor is generated. Through a learnable nuclear tensor and a transfer matrix, tensor decomposition is realized so that a purpose of network optimization compression is reached. Compared to an existing low rank compression method, by using the method of the invention, under the condition that network performance is maintained, a network parameter quantity has a greater compression multiple and a higher compression multiple can be acquired. Simultaneously, a guiding position of a nonzero element does not need to be stored, an index does not need to be recorded and a storage space can be effectively used.

Description

technical field [0001] The invention relates to the technical field of deep learning network optimization compression, in particular to a deep network compression method based on dimension adaptive Tucker decomposition. Background technique [0002] Due to the rapid development of image processor (GPU) computing power in recent years and the increasing amount of data that people can obtain, deep convolutional networks have achieved significant results in the fields of computer vision and natural language processing. According to the development history of the neural network structure (documents [1]~[5]), there are two major trends in the current network structure: the number of layers of the network is getting deeper and deeper, and the convolutional layer is becoming more and more important. While these deeper networks can achieve good results on some problems, their computational cost and storage cost are not a small problem. [0003] Due to the limitations of certain con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08
CPCG06N3/084G06N3/045
Inventor 林宙辰钟之声尉方音
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products