Supercharge Your Innovation With Domain-Expert AI Agents!

Method for small matrix inversion by using GPU (graphic processing unit)

A technology of small matrices and square matrices, applied in the field of inverting small matrices using GPU, can solve the problems of small parallel threads, reduce parallel efficiency, and share more memory, and achieve the effect of improving speed and good scalability

Inactive Publication Date: 2012-07-11
TSINGHUA UNIV
View PDF4 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, if one thread is used to calculate the inverse of a matrix, then each thread consumes more shared memory, thereby limiting the number of concurrent threads on the SM, thereby reducing its parallel efficiency
On the other hand, if one thread block is used to calculate the inverse of a matrix, that is, each thread in the thread block processes one element of the matrix, since the matrix size we have to process is often small (for example, 2×2, 4×4, 8×8), so the parallel threads on a thread block are too small, which will also affect its efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for small matrix inversion by using GPU (graphic processing unit)
  • Method for small matrix inversion by using GPU (graphic processing unit)
  • Method for small matrix inversion by using GPU (graphic processing unit)

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0031] figure 1 It is a flow chart of a method for inverting a small matrix using a GPU according to an embodiment of the present invention. Such as figure 1 As shown, the method includes:

[0032] Step A: Select a thread block composed of two-dimensional computing threads, the value of the first dimension of the thread block corresponds to the order of the square matrix to be processed, and is set to N, and the value of the second dimension corresponds to the order of the square matrix to be processed number, set to K. N and K are natural numbers greater than 0, where N is preferably 2, 4 or 8.

[0033] Step B: Create a two-dimensional array sm_a with a dimension of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for small matrix inversion by using a GPU (graphic processing unit), which relates to the field of wireless communication. The method includes the steps: setting up a two-dimensional array sm_a with KX(NXN) serving as dimensionality on a shared memory of the GPU, and setting up a two-dimensional array sm_is and a two-dimensional array sm_js, wherein the dimensionality of each of the two-dimensional array sm_is and the two-dimensional array sm_js is KXN, and each of K and N is a natural number larger than 0; parallelly storing K N-order square matrixes in a global memory of the GPU into the two-dimensional array sm_a of the shared memory; and completing inversion processing for the K N-order square matrixes in the shared memory by the aid of the two-dimensional array sm_is and the two-dimensional array sm_js. By the aid of the method, parallel threads are increased, the shared memory is not excessively occupied, good expandability is achieved, and calculating speed for small matrix inversion is remarkably increased.

Description

technical field [0001] The invention relates to the technical field of wireless communication, in particular to a method for inverting a small matrix by using a GPU. Background technique [0002] Matrix inversion is an important matrix operation that is often encountered, and it is widely used in signal processing, neural network, automatic control and other fields. Especially in 4G wireless communication standards, multiple key functional modules, such as OFDM (Orthogonal Frequency Division Multiplexing, Orthogonal Frequency Division Multiplexing) system channel estimation, MIMO (Multiple-Input Multiple-Out-put, multiple input and output antenna system ) signal detection, etc., when using the zero-forcing algorithm or the minimum mean square error algorithm, it can be attributed to the inverse operation of a certain transformation of the channel matrix. In addition, for the LDPC (Low Density Parity Check Code, low Density parity check code) code encoding also requires larg...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/16
Inventor 隋丹李云洲周春晖赵熠飞赵明王京
Owner TSINGHUA UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More