Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Fast fourier transformation (FFT) parallel method based on multi-core digital signal processor (DSP) platform

A platform and multi-core technology, applied in the field of fast Fourier transform FFT processing, can solve problems such as algorithm complexity, waste of resources, and discussion of algorithm scalability solutions, and achieve the effect of improving computing efficiency and low communication distance

Inactive Publication Date: 2012-11-28
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF2 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

2011.01.05 Chinese patent application CN 101937422A proposed a FFT parallel method based on a GPU many-core platform. The parallel FFT implementation on the GPU platform only emphasized the parallel operation of many cores, and did not disclose the communication rate and header between cores. Adequate attention and research to department expenditure
Liu Boda and others disclosed a configurable multi-core parallel FFT algorithm in an LTE system on China Integrated Circuit P24-28. They only split and realized the two-core and four-core environments, and did not carry out the scalability scheme of the algorithm. Discussion, and the implementation platform of this scheme is in FPGA, when the calculation amount is relatively large, more dedicated hardware chips are required, the power consumption of the system will be relatively large, and it is not easy to implement software radio
Zhong Cui-xiang and Han Guo-qiang proposed a parallel FFT algorithm (Some New Parallel Fast Fourier Transform) at the Sixth International Conference on Parallel and Distributed Computing in 2005. Alogrithms), but the algorithm is more complicated when dealing with the interaction between processors
In System Engineering and Electronic Technology 2003 (10): 1192-1196 published by Liu Li et al. in 2003, the multi-DSPs parallel processing algorithm and implementation of the large-point FFT adopt the former part of the butterfly to be divided into several processors. Parallel processing, the final butterfly is all concentrated on one processor, the degree of parallelism is not high enough, no doubt a waste of resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Fast fourier transformation (FFT) parallel method based on multi-core digital signal processor (DSP) platform
  • Fast fourier transformation (FFT) parallel method based on multi-core digital signal processor (DSP) platform
  • Fast fourier transformation (FFT) parallel method based on multi-core digital signal processor (DSP) platform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0064] Taking the number of processors P=4 as an example, set processor 0 as the main processor, and processors P0, P1, P2, and P3 are connected together to form a mesh network structure. The parallel FFT algorithm is implemented on the TMS320C6474EVM board, Figure 4 It is a data interaction diagram between 4 cores. SRIO (Serial Rapid I / O, fast serial input and output) interface is used for data communication between processor P0 and processor P1, between processor P2 and processor P3; between processor P0 and processor P2 1. The processor P1 and the processor P3 use an EDMA3 (Enhanced DMAversion3 Enhanced Direct Memory Access) interface for data communication.

[0065] The specific implementation steps of 4 cores are as follows:

[0066] The first step: preprocessing. Processor P0 first performs bit inversion on the data, and then evenly distributes the inverted data to 4 processor units through the SRIO interface or EDMA3, and each processor unit obtains N / 4 data. See ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a fast fourier transformation (FFT) parallel method based on a multi-core digital signal processor (DSP) platform. The method comprises the following steps that: a main processor uniformly distributes data of an original point N to be subjected to FFT to all P processors; sub FFT is executed on a point N / P on each processor, and thus FFT results of the previous log2(N / P) levels of the original point N are obtained; and due to data interaction between the processors, so that FFT from the log2(N / P)th level to the log2(N-1)th level of the original point N can be finished. During data interaction, the standard whether the number of the processors meets the equation i%t=i%(2t) is used for selecting an interaction object of the processors, so that the communication distance between the processors is shortest, and the operation efficiency is improved.

Description

technical field [0001] The invention relates to a fast Fourier transform FFT processing technology, in particular to an FFT processing technology based on a multi-core digital signal processor DSP. Background technique [0002] Discrete Fourier transform (DFT) is an important transformation in digital signal analysis and processing, but its application is greatly limited due to its computational complexity. Since T.W.Coody and J.W.Tukey proposed the method of quickly calculating DFT, people have been exploring and improving the FFT algorithm on this basis. Over the years, they have successively proposed radix 2FFT, radix 4FFT, hybrid Fast algorithms such as radix FFT and split radix FFT. Each kind of FFT transformation is divided into two kinds of decimation by frequency and decimation by time. [0003] At present, in scientific and engineering practice, the most widely used is base-2FFT, that is, the number of FFT points is N=2 M , where M is a positive integer. [0004...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/14
Inventor 王坚李玉柏薛姗姗
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products