Unlock instant, AI-driven research and patent intelligence for your innovation.

A self-organized deep neural network acceleration chip computing device

A deep neural network and accelerator chip technology, applied in the field of computing devices, can solve problems affecting energy efficiency and achieve the effects of improving energy efficiency, avoiding cross-access, and reducing the number of handling times

Active Publication Date: 2021-08-17
FUDAN UNIV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since the eigenvalues ​​of multiple input channels often need to be superimposed across channels before the final output eigenvalues ​​are obtained, this limitation leads to the need to exchange data between multiple on-chip memories, thus affecting the final energy efficiency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A self-organized deep neural network acceleration chip computing device
  • A self-organized deep neural network acceleration chip computing device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The technical solutions of the present invention will be further described below in conjunction with specific embodiments and accompanying drawings. It should be understood that the embodiments described below are used to illustrate rather than limit the technical solution of the present invention. The drawings are only embodiments of the present invention, and those skilled in the art can obtain other drawings according to the provided drawings without creative work.

[0023] The embodiment is a computing device of a deep neural network convolution computing accelerator. figure 1 Its top-level block diagram.

[0024] The device includes 16 two-stage on-chip storage modules 11 , a 16×16 multiply-accumulate computing unit array 12 , and a central control unit 13 .

[0025] Each storage module 11 includes a L2 cache 111 and a L1 read-only cache 112, both of which are realized by on-chip SRAM. The data in the first-level read-only cache is a copy of the data in the seco...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of integrated circuits, in particular to a computing device of a channel self-organized deep neural network acceleration chip. The device of the present invention includes: a plurality of on-chip memory modules for storing eigenvalue matrices by channel; a multi-row multi-column computing unit array for performing operations such as convolution and matrix multiplication in parallel; a central control unit for controlling Computational process and data flow, and data interaction with the outside world. Computations from the same input channel are processed in parallel by the same column of compute units in the compute unit array, and computations from the same output channel are processed in parallel by the same row. In the computing process, the device organizes the data flow according to the corresponding relationship between the channel and the memory, avoids the cross access of the computing unit among multiple memories, reduces the number of data transfers between the memories, and improves the energy efficiency of the chip.

Description

technical field [0001] The invention belongs to the technical field of integrated circuits, and in particular relates to a computing device for a deep neural network acceleration chip. Background technique [0002] Today, deep neural network algorithms are widely used in many fields such as autonomous driving, target recognition and detection, and machine translation. As the application of deep learning algorithms becomes increasingly complex, traditional CPUs and GPUs become more and more stretched in terms of energy efficiency, and a series of dedicated acceleration chips for deep neural network algorithms have emerged as the times require. [0003] The deep neural network algorithm consists of many layers, and the input eigenvalues ​​and output eigenvalues ​​of each layer are matrix groups composed of multiple channels. After the input eigenvalues ​​undergo a series of linear operations (such as convolution, matrix multiplication), and then pass through a non-linear acti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/082G06N3/045
Inventor 朱浩哲王彧张怡云史传进
Owner FUDAN UNIV