Universal computing circuit of neural network accelerator

A general-purpose computing and neural network technology, which is applied in the field of general-purpose computing module circuits, can solve problems such as poor flexibility, long hardware development cycle of convolutional neural network, and platform limitations, so as to reduce complexity, shorten reasoning time, and simplify The effect of the circuit design scheme

Active Publication Date: 2020-02-18
HEFEI UNIV OF TECH
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Although the current FPGA platform can realize the development of convolutional neural network, the platform also has limitations:
[0004] 1) When the FPGA platform uses the hardware description language to develop the convolutional neural network, it lacks modular hardware circuit design, cumbersome debugging, and the hardware development cycle of the convolutional neural network is long;
[0005] 2) Since the traditional development method of FPGA is to use the hardware description language to describe the circuit behavior, when designing and building the neural network, it is necessary to consider various variables of the built neural network, such as the size of the convolution kernel, the number of feature maps, and the number of convolutional layers , the number of fully connected layers and network output categories, etc., the hardware design of basic components such as convolutional layers, pooling layers, fully connected layers, and activation function layers is relatively rigid and has poor flexibility. If some variables of the neural network change, it is necessary to Reuse the hardware description language to describe the underlying circuit behavior, which has poor versatility;

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Universal computing circuit of neural network accelerator
  • Universal computing circuit of neural network accelerator
  • Universal computing circuit of neural network accelerator

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In this example, if figure 1 As shown, a general computing circuit of a neural network accelerator is composed of m general computing modules PE, wherein any general computing module PE is composed of RAM, 2 n A multiplier, an adder tree, a cascaded adder, an offset adder, a first-in-first-out queue, and a ReLu activation function module; in this embodiment, n=2, thus obtaining 4 multipliers

[0041] In the current cycle, the 4 multipliers obtain the stored weight data from the RAM, and receive and process the calculated data input from the outside, and then pass the 4 products in the current cycle to the adder tree;

[0042] The adder tree accumulates the 4 products in the current cycle, and after obtaining the accumulated sum in the current cycle, it is stored in the first-in-first-out queue;

[0043] The first-in-first-out queue will accumulate and read the current cycle and pass it to the cascaded adder, and the first-in-first-out queue will play the role of cachin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a general calculation module circuit of a neural network accelerator. The general calculation module circuit is composed of m universal computing modules PE, any ith universalcomputing module PE is composed of an RAM, 2n multipliers, an adder tree, a cascade adder, a bias adder, a first-in first-out queue and a ReLu activation function module. The single PE convolution configuration, the cascaded PE convolution configuration, the single PE full-connection configuration graph and the cascaded PE full-connection configuration are utilized to respectively construct calculation circuits of different neural networks. According to the invention, the universal computing circuit can be configured according to the variable of the neural network accelerator, so that the neural network can be built or modified more simply, conveniently and quickly, the inference time of the neural network is shortened, and the hardware development time of related deep research is reduced.

Description

technical field [0001] The invention belongs to the field programmable gate array (Field Programmable Gate Array, FPGA) design technical field of an integrated circuit, specifically a general computing module circuit of a neural network accelerator. Background technique [0002] In 2012, AlexNet won the championship of the large-scale visual recognition challenge, and the deep neural network has become a research hotspot again. Among them, the research on the convolutional neural network has received more and more attention, and has been widely used in digital video surveillance and face recognition. , image classification and other fields. The learning process of the convolutional neural network will use a large number of iterative operations and data reading. Because the CPU has a limited number of cores, it cannot make full use of the parallelism existing in the neural network. In order to improve the calculation speed of convolutional neural network, researchers have pr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/063G06N3/04
CPCG06N3/063G06N3/045
Inventor 杜高明任宇翔曹红芳张多利田超宋宇鲲李桢旻
Owner HEFEI UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products