Systolic Array Based Sparse Neural Network Processor

A neural network and processor technology, applied in the field of computer architecture for sparse neural network computing, can solve the problems of irregularity of sparse patterns and difficulty in effectively utilizing the sparsity of neural networks, etc., to reduce memory capacity, reduce space overhead, The effect of improving efficiency

Active Publication Date: 2022-05-27
BEIHANG UNIV
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, it is worth noting that due to the irregularity of the sparse patterns of neural network weights, gradients, and features, this sparsity of neural networks is often difficult to be effectively utilized, especially on highly regular computing platforms such as systolic arrays.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Systolic Array Based Sparse Neural Network Processor
  • Systolic Array Based Sparse Neural Network Processor
  • Systolic Array Based Sparse Neural Network Processor

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below with reference to specific embodiments and accompanying drawings.

[0037] figure 1 It is a schematic diagram of the architecture of a sparse neural network processor according to an embodiment of the present invention. It includes storage unit, control unit, bus array, sparse matrix operation array and calculation unit. The storage unit is used to store the neural network weights, gradients, features, and instruction sequences for data flow scheduling. The control unit is connected with the storage unit, and according to the scheduling of the instruction sequence, obtains the required data from the storage, reshapes the data into the form of matrix operation, and then is bridged by the bus array and sent to the sparse matrix operation array. to complete the corresponding calculation. The calculation uni...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a sparse neural network processor based on a systolic array, which includes a storage unit, a control unit, a sparse matrix operation array, a calculation unit, and a bus array. The storage unit is used to store weights, gradients, features, and instruction sequences for scheduling data streams. According to the control of the instruction sequence, the control unit takes out the data required for the training and reasoning process from the storage unit, converts it into a sparse matrix operation format, and sends it to the sparse matrix operation array. The sparse matrix operation array includes multiple processing units connected in a systolic array to complete the sparse matrix operation. The computing unit is used to complete element-wise operations such as nonlinear activation functions. The bus array uses internal data transfer to send the same data segment to different rows of the systolic array to reduce storage overhead. This processor makes full use of the sparsity of weights and features to improve the speed and power consumption ratio of neural network training and reasoning, and has the advantages of high concurrency and low bandwidth requirements.

Description

technical field [0001] The present invention relates to neural network technology and computer architecture, in particular to a computer architecture used for sparse neural network computing. Background technique [0002] In recent years, deep learning has gradually made more and more remarkable achievements in fields such as image recognition and speech processing. However, with the continuous increase of network depth, the computing power and memory access bandwidth required in the process of deep neural network training and inference are gradually difficult to be satisfied by traditional computing platforms. Therefore, the industry and academia have proposed various domain-specific architectures (domain specified architectures) applied to neural networks to meet this demand. Among them, the systolic array architecture has attracted great attention from the industry and academia due to its characteristics of high concurrency and low bandwidth requirements. [0003] On th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06N3/063G06F15/80G06N3/04
CPCG06N3/063G06F15/8046G06N3/048
Inventor 杨建磊赵巍胜付文智
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products