Compression method for deep recurrent neural network based on SVD and pruning

A technology of recurrent neural network and compression method, applied in biological neural network model, neural architecture, character and pattern recognition, etc., can solve problems such as hindering the application of deep recurrent neural network, reduce the number of parameters, increase the compression factor, reduce the The effect of the number of bits

Inactive Publication Date: 2019-12-03
SOUTH CHINA UNIV OF TECH +1
View PDF4 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, while the deep recurrent neural network has good performance, its parameter storage hinders the application of the deep recurrent neural network in embedded devices such as mobile terminals.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Compression method for deep recurrent neural network based on SVD and pruning
  • Compression method for deep recurrent neural network based on SVD and pruning
  • Compression method for deep recurrent neural network based on SVD and pruning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0031] The embodiment of the present invention is mainly used to solve the model compression problem of the deep cycle neural network. By decomposing the parameter matrix based on SVD, removing redundant network connections, clustering weights, fine-tuning clustering results, and saving weight encodings, a set of algorithms for deep recurrent neural network model compression is established. The compression framework can achieve a high compression factor while ensur...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a compression method for a deep recurrent neural network based on SVD and pruning. The compression method comprises the steps of S1, performing SVD decomposition on a recurrent neural network needing to be compressed; s2, training the network after SVD decomposition again, and removing redundant connection step by step; s3, respectively performing K-means clustering on theweights of the remaining connections of each layer of the network; s4, training the clustered network again; and S5, encoding and storing the network weight. According to the invention, SVD and redundant network connection removing methods are combined; the number of parameters of the recurrent neural network is effectively reduced, the storage amount of the parameters is greatly reduced throughfurther K-means clustering and encoding storage of a sparse matrix, training of the network is combined in the series of processes, and it is guaranteed that a large compression multiple is achieved under the condition that the influence on the network performance is not large.

Description

technical field [0001] The invention relates to the technical field of deep learning and artificial intelligence, in particular to a compression method for deep cyclic neural networks based on SVD and pruning. Background technique [0002] In recent years, due to the emergence of large-scale data and the development of hardware platforms, deep learning algorithms have achieved a series of amazing results in the field of artificial intelligence. Among them, the proposal of deep recurrent neural network effectively solves the problem of identification and detection of data with time series information, and has a wide range of applications in the fields of language translation, video detection, and online handwritten character recognition. However, while the deep recurrent neural network has good performance, its parameter storage capacity hinders the application of the deep recurrent neural network in embedded devices such as mobile terminals. Therefore, it is of great signif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06K9/62
CPCG06N3/045G06F18/23213
Inventor 杨亚锋梁凯焕肖学锋金连文周伟英孙俊
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products