Supercharge Your Innovation With Domain-Expert AI Agents!

Neural network-oriented SIMT micro-architecture with variable calculation precision

A technology of calculation accuracy and neural network, applied in the field of SIMT micro-architecture, can solve problems such as neural network cannot be supported, neural network reasoning acceleration, floating-point operation cannot be operated, etc., to achieve low calculation bandwidth requirements, accelerate neural network calculation, hardware The effect of high utilization

Active Publication Date: 2021-06-18
SHANGHAI JIAO TONG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0013] Although Bit Fusion can use low-precision hardware in exchange for high throughput, Bit Fusion only supports fixed-point 2, 4, and 8 bits, thereby supporting the inference acceleration of neural networks, but the training of neural networks cannot support
In addition, the floating-point operations involved in scientific computing cannot be performed in Bit Fusion

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural network-oriented SIMT micro-architecture with variable calculation precision
  • Neural network-oriented SIMT micro-architecture with variable calculation precision
  • Neural network-oriented SIMT micro-architecture with variable calculation precision

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0044] In this embodiment, the neural network has the characteristics of large amount of calculation, many calculation parameters, high calculation bandwidth requirements, and variable calculation accuracy. On the basis of the SIMT architecture of Nvidia's GPGPU, the SM (Streaming Multiprocessor, SMP (SM Processing Block, SM processing block) micro-architecture, and focused on designing the circuit structure of SP (Streaming Processor, stream processor). The redesigned SMP has two application scenarios: the convolution operation mode and the general calculation mode. The convolution operation mode can efficiently complete the calculation of the neural network, and the general calculation mode can ensure the realization of the general algorithm.

[0045] 1. Overall structure

[0046] The SIMT (Single Instruction Multiple Thread, Single Instruction Multiple Thread) microarchitecture of the variable computing precision facing neural network designed in this embodiment is SMP (SM ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a neural network-oriented SIMT micro-architecture with variable calculation precision, which comprises an L1 instruction cache, an L0 instruction cache, a thread group scheduler, an instruction scheduling unit, a U-Core array, a special function unit, a register file and a read-write unit, and is characterized in that the L1 instruction cache, the L0 instruction cache and the thread group scheduler are sequentially connected with the instruction scheduling unit; the U-Core array and the special function unit are both connected with the instruction scheduling unit and the register file, the read-write unit is connected with the thread group scheduler, and the U-Core array is a computing unit array composed of a plurality of computing units U-Core which can be configured into different precisions. Compared with the prior art, the invention has the characteristics of high calculation flexibility, high calculation performance, high calculation efficiency, variable calculation precision, high universality and the like.

Description

technical field [0001] The invention relates to a SIMT microframework, in particular to a neural network-oriented SIMT microframework with variable calculation precision. Background technique [0002] Neural network (also known as artificial neural network) is a very widely used algorithm in the field of machine learning in recent years, and has achieved excellent results in computer vision, natural language processing and other fields, including convolutional neural network, loop neural network, etc. However, neural network algorithms often have a large amount of calculation, among which the largest amount of calculation is the convolutional neural network (Convolutional Neural Network, CNN). Typical convolutional neural networks include VGG, GoogLeNet, ResNet, etc. The amount of parameters Usually there are hundreds of MB, and the calculation amount can reach hundreds of MFLOPs (floating-point operations) or even GFLOPs, which puts higher demands on computing hardware. T...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/063G06N3/04
CPCG06N3/063G06N3/045Y02D10/00
Inventor 江子山梁晓峣景乃锋宋卓然王建飞
Owner SHANGHAI JIAO TONG UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More