Unlock instant, AI-driven research and patent intelligence for your innovation.

A convolutional neural network computing device and method

A convolutional neural network and computing device technology, applied in the field of integrated circuits and artificial intelligence, can solve the problems of high cost of integrated circuit implementation, large consumption, and large circuit power consumption

Active Publication Date: 2021-05-14
BEIJING JIAOTONG UNIV
View PDF10 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The core of the convolution operation is the multiply-accumulate operation. Therefore, the existing convolutional neural network hardware acceleration circuits adopt a single multiply-accumulate array structure to accelerate the multiply-accumulate operation. This multiply-accumulate array is composed of thousands of identical multiply-accumulate units. , each multiplying and accumulating unit needs a multiplier circuit, and the hardware acceleration circuit of this structure often needs to consume a lot of hardware resources to realize thousands of multiplier circuits, resulting in the disadvantages of high integrated circuit implementation cost and large circuit power consumption.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A convolutional neural network computing device and method
  • A convolutional neural network computing device and method
  • A convolutional neural network computing device and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0012] The present invention will be further described below in conjunction with a set of embodiments and accompanying drawings.

[0013] This embodiment discloses a convolutional neural network computing device, which is used to accelerate the core computing method of the convolutional neural network—convolution operation. Such as figure 1 Shown is a schematic diagram of the principle of convolution operation. The input of the convolution operation is called the input neural network feature map 101, and the input neural network feature map pixel value 1011 (ie, an element of the three-dimensional matrix) is represented by F in (r, c, n) means, where (r, c, n) is the logical index 1012 of the feature map of the convolutional neural network, and 0≤rout (r', c', m), where 0≤r'<R', 0≤c'<C'0≤m'<M, is calculated by the convolution operation defined by the following mathematical expression:

[0014]

[0015] Among them, s is the sliding displacement of the convolution operation...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a convolutional neural network computing device and method. The device includes a neural network model buffer for caching and encoding the convolutional neural network model; a neural network model decoder for reading the encoded model and decoding the model to obtain the model The logical index and control information of the weight; the input neural network feature map buffer is used to cache the pixel value of the input feature map; the feature map storage controller is used to read the feature map pixel value data according to the feature map logical index; the accumulator array is used for Add the input feature map pixel values ​​multiplied by the same neural network model weight value, and generate a temporary accumulation result; the pipeline buffer is used to cache the temporary accumulation result; the multiplication accumulator array is used to compare the temporary accumulation result with the corresponding The weight values ​​of the neural network model are multiplied, and the multiplication results belonging to the current convolution operation are added to generate the pixel value of the output feature map; the output neural network feature map buffer is used to cache the pixel value of the output feature map.

Description

technical field [0001] The present invention relates to the fields of integrated circuits and artificial intelligence, and more specifically relates to a device for accelerating convolutional neural network reasoning operations and a corresponding neural network model encoding method. Background technique [0002] With the rise of deep learning technology, convolutional neural networks are widely used in various fields, such as computer vision, image processing, speech recognition, autonomous robots, driverless cars, etc. As the core algorithm of deep learning technology, convolutional neural network has many advantages such as high reasoning accuracy and strong fault tolerance; however, it also has problems such as huge amount of calculation and consumption of system storage resources. The convolution operation in the convolutional neural network usually consumes more than 90% of the running time of the algorithm. Therefore, in order to realize the real-time calculation of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/04G06N3/063
CPCG06N3/063G06N3/045
Inventor 王东
Owner BEIJING JIAOTONG UNIV