Unlock instant, AI-driven research and patent intelligence for your innovation.

Support vector memory access method for data reorganization by module

A data and vector technology, applied in the field of vector memory access components, can solve the problems of reducing the execution efficiency of algorithms such as FFT, limited bandwidth of shuffling components, reducing program execution efficiency, etc., so as to reduce code size, mapping difficulty and code size. , the effect of improving execution efficiency

Active Publication Date: 2016-04-06
NAT UNIV OF DEFENSE TECH
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This limitation of instruction issue slots will reduce the execution efficiency of algorithms such as FFT
2) Shuffle components have limited bandwidth
However, due to the limitation of hardware overhead, Crossbar adopts a vector input and a vector output method during implementation, and the actual effective bandwidth is small.
3) There is an overhead in setting the shuffling mode
Although the multiple shuffling modes that Crossbar's shuffling component can provide, it is necessary to set and call different shuffling modes when using it, which introduces additional overhead
4) Irregular vector access problem of complex numbers
This will introduce additional shuffling operations after the vector Load / Store operation, thereby reducing the execution efficiency of the program

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Support vector memory access method for data reorganization by module
  • Support vector memory access method for data reorganization by module
  • Support vector memory access method for data reorganization by module

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0030] like figure 1 Shown is the structure of a typical SIMD processor. The SIMD processor generally includes a scalar unit 1 and a vector unit 2, wherein the scalar unit 1 is mainly responsible for processing tasks such as the serial execution part, execution branch, interrupt, and system configuration in the application; the vector unit 2 is mainly responsible for processing the parallel tasks in the application. accelerate. The scalar unit 1 and the vector unit 2 use a unified (or separate) instruction fetch unit 4 and an instruction dispatch unit 3 to fetch instruction packets from the instruction Cache 5 and then dispatch instructions. The following focuses on the discussion of the structure of the vector unit. The vector unit 2 includes multiple parallel processing elements 6 (ProcessingElement, PE), and the processing elemen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Disclosed is a vector memory method capable of supporting modular reconstitution of data. The vector memory method includes increasing the modular reconstitution type MT domain in the basic vector Load / store instruction; to the vector Load instruction, executing a corresponding reconstitution mode when the MT is of the different value, performing modular reconstitution to data read from a VM (vector memory) and then writing the data into the VR(vector register); to the vector Store instruction, executing the corresponding reconstitution model when the MT is of the different value, and performing modular reconstitution to the data read from the VR and then writing the data into the VM. The width of the MT is 2 bits. Part of functions of a data shuffle and vector memory components of an SIMD (single instruction stream multiple data stream) processor can be combined, so that execution efficiency of algorithms represented by FFT (fast Fourier transform) on the SIDM processor can be improved remarkably while difficulty in mapping for programmers and code size of application programs can be reduced. Besides, by the vector memory method capable of supporting the modular reconstitution of data, the problem of vector access of complex data in the SIMD processor can be solved effectively.

Description

technical field [0001] The present invention mainly relates to the field of vector memory access components of processors adopting Single Instruction Stream Multiple Data Stream (SIMD) technology, and in particular refers to a vector memory access method that supports data recombination according to modules. Background technique [0002] SIMD technology can fully develop the parallelism of applications with low hardware overhead by using multiple processing units to share the same set of control components such as instruction fetching, decoding, address calculation, and memory access. Therefore, in current processors, such as General-purpose processors, graphics processors, and digital signal processors have been widely used. SIMD technology is divided into subword SIMD and vector SIMD. The former focuses on parallelism between multiple bytes or halfwords in a word, and the latter focuses on parallelism among multiple words. Currently, processors using SIMD technology (here...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F13/16G06F17/14
Inventor 刘胜陈海燕万江华陈书明刘宗林彭元喜刘仲陈胜刚陈小文雷元武燕世林
Owner NAT UNIV OF DEFENSE TECH