Supercharge Your Innovation With Domain-Expert AI Agents!

Sparse matrix vector multiplication calculation unit for arranged block diagonal weight matrix

A technology of weight matrix and sparse matrix, applied in the field of sparse matrix-vector multiplication calculation unit, can solve the problems of not considering the sparseness of intermediate products and the increase of system power consumption, and achieve the effect of eliminating storage overhead and avoiding accumulation operations.

Active Publication Date: 2020-03-17
扬州伊达实业有限公司
View PDF13 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

On the contrary, because of the addition of additional comparison operations, it will also increase the power consumption of the entire system
(3) The calculation unit does not consider the sparsity of the intermediate product, because the product generated by zero-value weight or zero-value excitation is also zero, so the intermediate product has the same or greater sparsity than the weight matrix

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sparse matrix vector multiplication calculation unit for arranged block diagonal weight matrix
  • Sparse matrix vector multiplication calculation unit for arranged block diagonal weight matrix
  • Sparse matrix vector multiplication calculation unit for arranged block diagonal weight matrix

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0025] The sparse matrix-vector multiplication calculation unit for the arranged block diagonal weight matrix of this embodiment, such as figure 1 , including: several processing units and an accumulator; the output of the processing unit is connected to the accumulator, and the accumulator is used to accumulate the outputs of all processing units; in order to reduce consumption, the processing unit is in the first stage, and the accumulator is in the second to Nth stages, The outputs of the two processing units are connected to an accumulator at the second stage, and the outputs of the two accumulators at the n+1th stage are connected to an accumulator at the n+2th stage, where n∈(1,N-2 ), the Nth stage contains 1 accumulator. If the processing unit or the accumulator in a certain level is singular, it only needs to be connected to the accumulator in the next level alone, which is equivalent to adding a 0 bit.

[0026] Assuming that the length of the input excitation vector ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a sparse matrix vector multiplication calculation unit for an arranged block diagonal weight matrix. The sparse matrix vector multiplication calculation unit comprises a plurality of processing units and an accumulator, and the output of the processing unit is connected with the accumulator. According to the sparse matrix vector multiplication calculation unit for the arranged block diagonal weight matrix provided by the invention, the sparse characteristic of the weight matrix after pruning is fully utilized, and multiplication operation between a zero-value weight and a corresponding input excitation element is avoided. Zero hopping operations can be dynamically enabled in conjunction with sparsity of input excitations. The sparsity of the intermediate product obtained by multiplying the weight by the corresponding input excitation is fully utilized, and the accumulation operation between the zero product and the corresponding product is avoided. The designedpointer generator eliminates the storage overhead of pointers recording non-zero value position information.

Description

technical field [0001] The invention relates to a sparse matrix-vector multiplication calculation unit for arranged block diagonal weight matrices, which belongs to the technical field of integrated circuit design. Background technique [0002] In recent years, deep learning algorithms have begun to replace traditional algorithms and become mainstream algorithms in many fields due to their excellent performance. However, the current mainstream processors (CPU, GPU, DSP, etc.) are still unable to better adapt to their data-intensive computing characteristics. Therefore, there has been a wave of research on deep learning processors in the academic and industrial circles. [0003] The fully connected operation is a very important type of operation in the neural network algorithm, and it is also a type of operation with a very large amount of calculation. Therefore, efficiently performing such operations becomes the key to improving the performance of deep learning processors. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F30/39
Inventor 郑勇陈柱佳舒毅
Owner 扬州伊达实业有限公司
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More