Unlock instant, AI-driven research and patent intelligence for your innovation.

A multi-layer data sub-area joint computing method for convolutional neural network acceleration

A technology of convolutional neural network and calculation method, which is applied in the field of joint calculation of multi-layer data and sub-regions, can solve the problems of low data access bandwidth requirements, high data access bandwidth, and a large number of computing resources, so as to improve the overall computing speed , Improve the utilization rate and reduce the consumption of hardware resources

Active Publication Date: 2021-04-20
XI AN JIAOTONG UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such characteristics will cause the following problems when hardware acceleration is performed: a large amount of computing resources are required to calculate the convolutional layer, and the data access bandwidth is not high; when the fully connected layer is calculated, high data access bandwidth is required, such an imbalance Will bring difficulties to hardware design

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A multi-layer data sub-area joint computing method for convolutional neural network acceleration
  • A multi-layer data sub-area joint computing method for convolutional neural network acceleration
  • A multi-layer data sub-area joint computing method for convolutional neural network acceleration

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The present invention will be further described in detail below in conjunction with specific embodiments, which are explanations of the present invention rather than limitations.

[0025] In the present invention, multi-layer data sub-area joint calculation divides the input image data into different areas, and then performs acceleration calculations on these areas one by one, thereby completing the overall acceleration of the convolutional neural network. The main purpose is to overlap the data access time of the fully connected layer of the convolutional neural network and the computing time of the convolutional layer through the joint calculation of the regions, and balance the computationally intensive convolutional layer of the convolutional neural network and the data of the fully connected layer Storage intensive. Calculation mode such as figure 2 As shown, the input image of the convolutional layer is divided into regions to form multiple input image regions. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a multi-layer data sub-area joint calculation method for convolutional neural network acceleration, which includes the following steps: Step 1, divide the first fully connected layer image into several first fully connected layer areas, reverse derivation For the input image data corresponding to the image data of each first fully connected layer region, the input image is divided into several input image regions corresponding to the first fully connected layer region according to the reverse derivation result; step 2, for each The input image regions are sequentially convolved until the corresponding first fully connected layer regions. In the process of performing convolution operations on the next input image region, the weights of the first fully connected layer region corresponding to the previous input image region Value data is read and manipulated. It reduces the bandwidth and storage required by the hardware system during operation, and balances the demand for hardware resources between the convolutional layer and the fully connected layer of the convolutional neural network.

Description

technical field [0001] The invention relates to a data flow scheduling technology for convolutional neural network operations, in particular to a multi-layer data sub-area joint calculation method for convolutional neural network acceleration. Background technique [0002] Deep learning is a research hotspot in machine learning in recent years, and excellent results have been achieved in many directions. Deep learning is now playing an increasingly important role in many disciplines. However, limited by hardware devices, the operation speed of deep learning is slower than that of traditional artificial neural computing networks and some other machine learning methods. Therefore, in some fields that require high-performance computing and computer computing, the acceleration of deep learning is particularly important. For hardware acceleration of deep learning algorithms, there are currently three types of implementation methods, including multi-core CPU, GPU, and FPGA. Thei...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/04
CPCG06N3/045
Inventor 杨晨张海波王逸洲王小力耿莉
Owner XI AN JIAOTONG UNIV