Configurable matrix register unit for supporting multi-width SIMD and multi-granularity SIMT

A technology for configuring matrices and registers, applied in machine execution devices, concurrent instruction execution, etc., can solve problems such as inability to develop parallelism, inability to support multi-width SIMD processing, inability to flexibly process matrix data, etc., to achieve block size and number of threads Flexible configuration, efficient and flexible processing, and easy operation

Active Publication Date: 2014-09-10
NAT UNIV OF DEFENSE TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Existing research provides access to block data of a fixed size in the above-mentioned matrix registers. These technologies read and write a row vector or a column vector of the matrix at a time. The length of the vector is fixed. When the length of the vector is greater than or less than the fixed length, Usually, multiple short vectors are combined into one long vector for parallel processing, or a long vector is split into several short vectors for step-by-step processing, which cannot flexibly handle matrix data of different sizes, and does not support multi-width SIMD processing. It also does not support simultaneous access to multiple matrix data in the form of multi-granularity SIMT, neither can obtain enough flexibility, nor can it develop enough parallelism, especially thread-level parallelism

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Configurable matrix register unit for supporting multi-width SIMD and multi-granularity SIMT
  • Configurable matrix register unit for supporting multi-width SIMD and multi-granularity SIMT
  • Configurable matrix register unit for supporting multi-width SIMD and multi-granularity SIMT

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0027] like figure 1 Shown is a schematic diagram of the overall structure of the matrix register of the present invention. The configurable matrix register unit supporting multi-width SIMD and multi-granularity SIMT of the present invention includes a matrix register and a control register SR.

[0028] When the read and write enable signal is valid, the address decoding logic unit performs decoding under the control of the row and column selection signal and the control register SR according to the content of the read and write address, and selects a row vector or column vector of the matrix register to read and write. Or select one or more sub-row vectors or sub-column vectors to read and write. The matrix register is composed of N*N storage unit arrays, each unit has a bit width of W and a storage capacity of (N*N*W) bits. By co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a configurable matrix register unit for supporting multi-width single instruction multiple data stream (SIMD) and multi-granularity single instruction multiple threads (SIMT). The configurable matrix register unit comprises a matrix register and a control register SR; the matrix register of which the size is N*N is divided into M*M blocks, wherein N is a positive integer and is the power of 2, and M is an integer which is more than or equal to 0 and is the power of 2; the block modes of the matrix register and the multi-thread numbers simultaneously processed by a vector processing unit are recorded in the control register; and the width of the control register is log2C+log2T, wherein C is the number of the number of the block modes of the matrix register, and T is the number of multi-thread modes which can be processed by a vector processor. The configurable matrix register unit has the advantages that: the principle is simple; the configurable matrix register unit is simple and convenient to operate; the block size and the thread number can be configured flexibly; the access to vector data in the mode of multi-width SIMD and multi-granularity SIMT is supported at the same time and the like.

Description

technical field [0001] The present invention mainly relates to the design field of vector registers in vector processors, in particular to a matrix register with configurable block size and thread number in vector processors to support single instruction stream multiple data streams (SIMD) and single The vector operation unit operating in the instruction stream multithreading (SIMT) mode performs multi-width and multi-granularity access to data. Background technique [0002] With the in-depth research of 4G wireless communication technology and video image processing technology, vector processors have been widely used. Rapidly evolving wireless communication protocols and video image processing algorithms require a large number of matrix operations, such as channel estimation, MIMO equalization, and DCT transformation. The parallel granularity of matrix operations in different algorithms is different, and the size of the matrix blocks processed by the algorithms is also dif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F9/30G06F9/38
Inventor 陈书明张凯陈海燕万江华彭元喜刘仲阳柳杨惠刘蓬侠胡春媚唐涛
Owner NAT UNIV OF DEFENSE TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products