Unlock instant, AI-driven research and patent intelligence for your innovation.

Deconvolution calculation method, hardware accelerator, equipment and readable storage medium

A technology for deconvolution and calculation results, applied in the field of deep neural networks, can solve problems affecting the calculation efficiency of hardware accelerators, and achieve the effect of improving calculation efficiency

Pending Publication Date: 2022-01-11
NANJING UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Calculation process see figure 1 , a large number of zeros have been inserted into the data of the original input feature map, the blank cells represent the inserted zeros, the black cells represent the data of the original input feature map, and the gray cells represent the window. It can be seen that the window currently only contains two The data in the original input feature map, and the rest are all zero. When multiplying the data in the window with the data in the deconvolution kernel, these zeros will cause many multiplication operations to be invalid operations. In the entire deconvolution calculation In the process, there will be a large number of invalid operations, which will seriously affect the computing efficiency of the hardware accelerator.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deconvolution calculation method, hardware accelerator, equipment and readable storage medium
  • Deconvolution calculation method, hardware accelerator, equipment and readable storage medium
  • Deconvolution calculation method, hardware accelerator, equipment and readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0086] In order to improve the reaction rate of the hardware accelerator, a method, a hardware accelerator, an apparatus, and a readable storage medium are disclosed in the following examples.

[0087] The first embodiment of the present application discloses a method of reverse volume calculation, see figure 2 The method shown is schematically showing:

[0088] Step S11, acquire a plurality of input blocks, the plurality of input blocks, for each sliding data block over, in the original input feature, wherein the original input feature is included in the input feature. Correspondingly, each of the input blocks comprise multi-layer data, the size of the sliding window is set according to the size of the reverse core and the flexible step size.

[0089] In step S12, according to the first front matrix, the data in the reverse core is transformed, and the reverse volume core matrix is ​​acquired, and the size of the first front matrix is ​​based on the size and reverse of the revers...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a deconvolution calculation method, a hardware accelerator, equipment and a readable storage medium. The method comprises the following steps: acquiring a plurality of input blocks; transforming data in the deconvolution kernel according to the first preposed matrix to obtain a deconvolution kernel matrix; transforming data in the plurality of input blocks according to the second preposed matrix to obtain a plurality of input matrixes; then multiplying the deconvolution kernel matrix by a plurality of input matrixes to obtain a plurality of intermediate matrixes; accumulating data of all layers of data matrixes in any intermediate matrix according to channels to obtain a plurality of accumulation matrixes; according to the post-matrix, respectively transforming the data in the plurality of accumulation matrixes to obtain a plurality of output blocks; and sequentially arranging the plurality of output blocks into an output feature map, and obtaining a deconvolution calculation result. In the calculation process, a large number of zeros are not inserted into the original input feature map, so that the calculation efficiency is effectively improved.

Description

Technical field [0001] The present application relates to the depth of neural network technology, particularly to a method of deconvolution calculation, a hardware accelerator, and device-readable storage medium. Background technique [0002] FIG using hardware accelerators perform a deconvolution calculation, usually inserted in a large number of zero data of the original input feature map, the extension to give a new input to the input feature map feature, and then use the same size as a window size of new nuclear deconvolution FIG input pan sliding characteristics according to a preset step length, each time the slide, and the data in the window and then multiplying covered summed data deconvolution kernel, the output summation result, the window slides over the other new after entering all the data on the characteristic diagram, the summation of all the results obtained compared with the final result of the deconvolution calculation. [0003] See calculation figure 1 , The in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/15G06F17/16G06N3/04
CPCG06F17/153G06F17/16G06N3/045
Inventor 王中风杨培祥毛文东林军
Owner NANJING UNIV