Neural network accelerator for bit width partitioning and implementation method of neural network accelerator

A technology of neural network and implementation method, which is applied in the field of computer vision, can solve the problem of sacrificing hardware area indicators, and achieve the effects of increasing DSP utilization, increasing resource utilization efficiency, and improving computing throughput
CN107451659AActive Publication Date: 2017-12-08TSINGHUA UNIV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
TSINGHUA UNIV
Publication Date
2017-12-08

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The present invention provides a neural network accelerator for bit width partitioning and an implementation method of the neural network accelerator. The neural network accelerator includes a plurality of computing and processing units with different bit widths, input buffers, weight buffers, output buffers, data shifters and an off-chip memory; each of the computing and processing units obtains data from the corresponding input buffering area and weight buffer, and performs parallel processing on data of a neural network layer having a bit width consistent with the bit width of the corresponding computing and processing unit; the data shifters are used for converting the bit width of data outputted by the current computing and processing unit into a bit width consistent with the bit width of a next computing and processing unit corresponding to the current computing and processing unit; and the off-chip memory is used for storing data which have not been processed and have been processed by the computing and processing units. With the neural network accelerator for bit width partitioning and the implementation method of the neural network accelerator of the invention adopted, multiply-accumulate operation can be performed on a plurality of short-bit width data, so that the utilization rate of a DSP can be increased; and the computing and processing units (CP) with different bit widths are adopted to perform parallel computation of each layer of a neural network, and therefore, the computing throughput of the accelerator can be improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to computer vision technology, in particular to a neural network accelerator for bit width partitioning and its implementation method. Background technique

[0002] In recent years, convolutional neural networks have made great progress in many computer vision applications, such as image recognition, video classification, gesture recognition, etc. Recently, with the introduction of more and more convolutional neural network (CNN) algorithms, the recognition accuracy of CNN has been greatly improved, and the recognition accuracy reached 96.4% at the 2015 ILSVRC conference. However, in order to achieve such a high recognition accuracy, CNN needs to introduce very complex algorithms, which will increase the overhead of hardware computing resources and greatly limit the performance improvement of CNN at the hardware level. Therefore, many acceleration methods for CNN have been proposed to increase the computing performance of CNN, su...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More