Vectorization realization method for deconvolution matrix of GPDSP

An implementation method and deconvolution technology, which is applied in the field of vector processors and machine learning, can solve the problem of CNN model occupying large computing resources, and achieve the effects of avoiding data movement, low hardware cost, and improving reuse rate

Active Publication Date: 2017-07-18
NAT UNIV OF DEFENSE TECH
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Since the CNN model involves a large number of matrix operations, such as matrix-matrix multiplication, matrix-vector multiplication, vector-vector multiplication, matrix-matrix convolution, matrix expansion, matrix deconvolution, and various transcendental function calculations, CNN The model requires a lot of computing resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Vectorization realization method for deconvolution matrix of GPDSP
  • Vectorization realization method for deconvolution matrix of GPDSP
  • Vectorization realization method for deconvolution matrix of GPDSP

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0032] Suppose C=A*B, that is, the convolution of matrix A and matrix B is C, that is to say, the process of finding C from A and B is called convolution, then if you know C and A or C and B to find B or A The process is called deconvolution. Such as figure 2 Shown is a schematic diagram of a simplified structural model of the GPDSP targeted by the present invention.

[0033] Such as figure 1 and image 3 Shown, the vectorization implementation method of the deconvolution matrix facing GPDSP of the present invention, its steps are:

[0034] S1: Calculation of elements in the first n-1 rows of the deconvolution result matrix C;

[0035] S1.1 The CPU core of GPDSP allocates corresponding scalar storage space and vector storage space for the weight matrix generated in the forward propagation stage and the residual matrix in the rev...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a vectorization realization method for a deconvolution matrix of a GPDSP. A CPU core of the GPDSP distributes corresponding scalar storage space and vector storage space for a weight matrix generated at a forward propagation stage and a residual matrix at a reverse calculation stage in a convolutional neural network. The method comprises the steps that S1, a residual matrix A(m,m), a weight matrix B(n,n) and a deconvolution result matrix C(m+n-1,m+n-1) are set, wherein m is greater than n; S2, by controlling the number of cycles, elements in the first (n-1) rows of the deconvolution result matrix C are calculated first; S3, the number of cycles is fixed, and elements from the n(th) row to the m(th) row of the deconvolution result matrix C are calculated; and S4, by controlling the number of cycles, elements from the reciprocal (n-1)(th) row to the reciprocal first row of the deconvolution result matrix C are calculated. The method has the advantages that the principle is simple, operation is convenient, a vector processor can be fully utilized to complete special data calculation, total algorithm running time is shortened, and algorithm execution efficiency is improved.

Description

technical field [0001] The invention mainly relates to the fields of vector processors and machine learning, in particular to a vectorization implementation method of a GPDSP-oriented deconvolution matrix. Background technique [0002] Deep learning (DL) is an important research direction in the field of machine learning. DL simulates the hierarchical perception of the human brain by constructing a multilayer perceptron (MLP). MLP can express attribute categories or high-level abstract features by combining low-level features, thus becoming the focus of current research in the field of object recognition. [0003] The classic DL models mainly include Auto Encoder (AE), Deep Belief Networks (DBNs) and Convolutional Neural Networks (CNN). Generally speaking, the above model mainly extracts features from the input image through the encoder, and transforms the image from the bottom layer to the high-level feature space. Correspondingly, the decoder is used to reconstruct the fe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/15G06F17/16
CPCG06F17/153G06F17/16
Inventor 郭阳张军阳扈啸王慧丽胡敏慧王子聪
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products