Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Convolutional neural network reasoning accelerator and acceleration method

A convolutional neural network and accelerator technology, applied in the field of hardware implementation of artificial intelligence algorithms, can solve problems such as slow calculation speed, poor configurability, and high power consumption, and achieve high parallelism, guaranteed accuracy, and reduced power consumption.

Pending Publication Date: 2020-07-10
南京宁麒智能计算芯片研究院有限公司
View PDF1 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Aiming at the problems of slow calculation speed, high power consumption, poor configurability and fixed-point accuracy existing in the prior art, the present invention provides a convolutional neural network reasoning accelerator and an acceleration method, which can increase the calculation speed, reduce power consumption, improve The configurability of the device ensures the accuracy of the calculation results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Convolutional neural network reasoning accelerator and acceleration method
  • Convolutional neural network reasoning accelerator and acceleration method
  • Convolutional neural network reasoning accelerator and acceleration method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] According to the analysis of the convolution operation rules, it can be seen that the main operator of convolution has a large number of computing parallelism brought by data sharding. For example, multiple convolution kernels are independent and concurrent, and multiple input images are also Independent and concurrent, which provides ideas for hardware design.

[0043] Under the limitation of hardware resources and cost, to make full use of the parallelism of convolution operation, first of all, it must meet the requirements of high performance and low power consumption; secondly, it must meet the requirements of configurable algorithms and parameters, and improve scalability to meet different application scenarios. ; Finally, it is also necessary to improve the calculation accuracy and reduce the result error.

[0044] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0045] Such as figu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a convolutional neural network reasoning accelerator and an acceleration method, and belongs to the field of hardware implementation of an artificial intelligence algorithm. Aiming at the problems of high power consumption, poor configurability, low calculation precision and the like in the prior art, the invention provides the convolutional neural network reasoning accelerator and an acceleration method. The accelerator comprises a main control module and an address generation module. The system comprises an SRAM storage module, a data input module, a calculation engine module and a result output module, wherein parallel computing units in the computing engine module are arranged independently; on the basis of limited computing resources and storage resources, theperformance advantage of high parallelism is achieved, the parallel computing units have fixed-point interception and turn-off functions, the parallelism and the computing precision of the acceleratorare improved, and the use power consumption is reduced.

Description

technical field [0001] The invention relates to the field of hardware implementation of artificial intelligence algorithms, and more specifically, relates to a convolutional neural network reasoning accelerator and an acceleration method. Background technique [0002] Convolutional neural network is a deep feed-forward artificial neural network and one of the representative algorithms of deep learning. It has been successfully applied in computer vision, natural language processing and other fields. In a convolutional neural network, the convolutional layer can account for more than 80% of the total computing load and computing time of the entire network. Therefore, the acceleration of the convolutional layer is the key to improving the performance of the entire CNN network. In the calculation of the convolution layer, there are the following parameters: convolution kernel size (K), zero padding method (Pa), convolution step size (St), number of input image channels (Ch), nu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/063G06N5/04G06K9/00
CPCG06N3/063G06N5/04G06V10/95G06N3/045
Inventor 李丽黄延傅玉祥宋文清何书专
Owner 南京宁麒智能计算芯片研究院有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products