Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Multi-element LDPC code high-speed parallel decoder based on GPU, and decoding method thereof

An LDPC code and decoding technology, which is applied in the field of multivariate LDPC code high-speed decoders, can solve the problems of increasing global memory access, inability to fully utilize GPU performance, and decrease in resource utilization on-chip to reduce access , Improve resource utilization, improve the effect of resource utilization

Inactive Publication Date: 2018-08-28
BEIHANG UNIV
View PDF6 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Such a design scheme will bring two major problems: 1) greatly increase the synchronization overhead between thread blocks and the startup overhead of kernel functions, and increase the access to global memory (the access bandwidth of global memory is limited); 2 ) When the number is small, the utilization rate of on-chip resources will drop seriously, and the performance of the GPU cannot be fully utilized

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-element LDPC code high-speed parallel decoder based on GPU, and decoding method thereof
  • Multi-element LDPC code high-speed parallel decoder based on GPU, and decoding method thereof
  • Multi-element LDPC code high-speed parallel decoder based on GPU, and decoding method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The present invention will be described in detail below in conjunction with the accompanying drawings and embodiments.

[0027] The present invention firstly provides a high-speed parallel decoder for multiple LDPC codes based on GPU, such as figure 1 As shown, the decoder architecture includes a host end and a GPU device end (abbreviated as the GPU end), the host end is provided with a host memory and a CPU chip, the GPU end is provided with a GPU chip, global memory and constant memory, and between the host memory and the global memory Through the asynchronous transmission bus connection, the layered information vector, the row information vector and the code word of the decision are transmitted in the form of asynchronous transmission between the host memory and the global memory on the GPU side; the constant memory is used to store the check matrix and fast Fourier transform Transform (FFT) index table. The CPU chip is used to control the entire decoding process, a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-element LDPC code high-speed parallel decoder based on a GPU, and a decoding method thereof, and belongs to the technical field of the communication. The decoding method is divided into three stages: an initialization stage, a decoding stage and a judgment stage; once iterative decoding of the decoder is accomplished by only using a kernel function, the synchronization expenditure between the thread blocks and the starting expenditure of the kernel function are greatly reduced, and the access mount of the whole memory is lowered; the use rate of a hardware resource is maximized by designing different parallelism degrees under different systems, thereby improving the decoding throughput to 100M level. Through a memory-access way provided by the invention, theshared memory on a GPU chip is sufficiently utilized, the access to the whole memory is greatly reduced, and the memory-access efficiency and the memory-access bandwidth are improved.

Description

technical field [0001] The invention belongs to the technical field of communication, and relates to a high-speed decoder of multiple LDPC codes based on a Graphics Processing Unit (GPU, graphics processor). Background technique [0002] Davey and Mackay conducted research on the Sum Product Algorithm (SPA) of multivariate LDPC codes in 1998 (reference [1]: M.C.Davey and D.J.C.MacKay, "Low-density parity check codes over GF(q)," IEEE Communications Lett., vol.2, no.6, pp.165-167, 1998.). When the code length is medium and short codes, compared with binary LDPC, multivariate LDPC codes can obtain about 1dB coding gain, but this is at the cost of higher decoding complexity. On the premise of not losing decoding performance, in order to reduce the decoding complexity of multivariate LDPC codes, Mackay and Davey proposed an FFT-based SPA decoding algorithm in 2000, see reference [2]: D.J.C.MacKay and M.C.Davey, "Evaluation of Gallager codes for shortblock length and high rate ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): H03M13/11G06F9/50
CPCG06F9/505H03M13/1134
Inventor 刘荣科刘占献赵岭
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products