Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Mixed bit width accelerator based on DSP and fusion calculation method

A technology of accelerators and adders, applied in computing, instruments, biological neural network models, etc., can solve problems such as no higher parallelism implementation plan, poor versatility, and limitations, and achieve wide application and versatility Good, the effect of maximizing computing performance

Active Publication Date: 2022-03-25
XI AN JIAOTONG UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing problems are: 1) the neural network quantization bit width only considers 8 bits, and for networks with mixed bit width quantization, quantization parameters with lower bit width (such as 4 bit, 2 bit, 1 bit, etc.) are common; 2) only Consider the case of weight sharing, that is, the multipliers in the parallel two sets of multiplications are fixed, so the versatility is not strong; 3) limited by its 8-bit quantization bit width considerations, no higher degree of parallelism is proposed (i.e., realizing more sets at the same time) multiplication) implementation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mixed bit width accelerator based on DSP and fusion calculation method
  • Mixed bit width accelerator based on DSP and fusion calculation method
  • Mixed bit width accelerator based on DSP and fusion calculation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] In order to further understand the present invention, the present invention will be described below in conjunction with the examples. These descriptions are only to further explain the features and advantages of the present invention, and are not intended to limit the claims of the present invention.

[0048] Firstly, the high-bit-width DSP realizes two low-bit-width multiplication and accumulation schemes at the same time, such as figure 1 shown. For two sets of low-bit-width multiplications to be performed, the bit-widths of the multiplicand and the multiplier are n1 and n3, n2 and n4, respectively. The two multiplicands and the multiplier form the multiplication of two high-bit-width numbers by shifting and splicing, and the shift numbers during splicing are x+n4 and y+n2 respectively. Among them, x, y, n1, n2, n3, and n4 need to satisfy the following constraints:

[0049] x+y+2(n2+n4)≥max(n1+x+n4+n2+n4,n3+y+n2+n4+n2)+1 (1)

[0050] Under this constraint, by calcu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

According to the mixed bit width accelerator based on the high bit width DSP and the fusion calculation method, the DSP serves as a main calculation unit, a multiplier and a multiplicand are connected and shifted, namely different isolation bit widths are inserted, and multiple groups of arbitrary low bit width multiply-accumulate operation can be achieved. The accelerator supports any multiplication parallelism degree, and the DSP calculation performance is maximized; the method supports a multiplier and a multiplicand with any bit width, supports fixed and unfixed conditions of the multiplier, and is better in universality and wide in application range.

Description

technical field [0001] The invention relates to the design of a mixed-bit-width neural network accelerator, in particular to a mixed-bit-width accelerator based on a high-bit-width DSP and a fusion calculation method. Background technique [0002] The DSP mentioned in the present invention refers to the DSP IP in the FPGA, that is, the IP for realizing digital signal processing technology. The advantage of DSP lies in the high-speed and real-time nature of processing signals. There are special hardware multipliers inside the DSP, which can be used to quickly realize various digital signal processing algorithms. Taking the DSP48E1 IP of Xilinx 7 series FPGA as an example, it mainly includes a 25×18-bit wide multiplier, a 48-bit accumulator, etc., which can realize operations such as multiplication, multiplication accumulation, and bitwise logic operations without using FPGA resources. It is widely used in communication fields such as graphics and image processing, voice pro...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/063G06F7/523
CPCG06N3/063G06F7/523
Inventor 杨晨王佳兴席嘉蔚
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products