Instruction execution method for vector complex multiplication operation and corresponding device

A technology of complex number multiplication and instruction execution, applied in the field of microprocessor architecture, can solve the problems of long pipeline series, instruction window control queue blockage, increase of processor area, etc., to save area and cost, avoid expansion, and avoid complexity degree of effect

Active Publication Date: 2010-09-08
LOONGSON TECH CORP
View PDF1 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantages of this implementation method are: 1. The number of pipeline stages is long, and the pipeline start-up time is long, which easily leads to insufficient instruction windows and blockage of each control queue; 2. Multiplication and addition are separate arithmetic units, which increases the area of ​​the processor. , which

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Instruction execution method for vector complex multiplication operation and corresponding device
  • Instruction execution method for vector complex multiplication operation and corresponding device
  • Instruction execution method for vector complex multiplication operation and corresponding device

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0023] As mentioned above, the present invention aims to reduce the pipeline length and hardware implementation cost of vector complex number multiplication, and the main idea of ​​its invention is to use two instruction pairings to complete vector complex number multiplication operations. Among them, the first instruction is a vector complex number. For the multiplication operation, the second instruction performs vector complex multiplication and addition operations. In addition, the two instructions multiplex the vector multiply-add functional unit, and two floating-point single-precision and one floating-point double-precision multiplex a vector multiply-add functional unit.

[0024] reference Picture 9 The block diagram of the vector multiply-add function unit of the present invention is shown. Taking the operand as 256 bits as an example, the operation of the vector multiply-add function unit is explained in detail as follows: a vector multiply or vector multiply-add instr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an instruction execution method for vector complex multiplication operation (a+bj)*(c+dj) in a processor and a corresponding device. The method is characterized by comprising the step of designing two instruction matching pairs to finish the vector complex multiplication operation (a+bj)*(c+dj), wherein the first instruction executes vector multiplication, and the operation numbers of the first instruction comprise (a+bj) and (c+dj) for calculating partial results of the vector complex multiplication operation (a+bj)*(c+dj); and the second instruction executes vector multiplication and addition, and the operation numbers of the second instruction comprise the (a+bj), the (c+dj) and execution results of the first instruction for calculating rest results of the vector complex multiplication operation (a+bj)*(c+dj) and adding the rest results and the partial results to obtain a final result of the vector complex multiplication operation (a+bj)*(c+dj). By designing the two instruction matching pairs to finish the operation, the flow length of the vector complex multiplication operation is a flow grade of the multiplication and addition operation. Moreover, the method can greatly save the area and cost of a chip by complexing vector multiplication and addition functional components of the two instructions.

Description

technical field [0001] The invention relates to the technical field of microprocessor architecture, in particular to an instruction execution method and a corresponding device for multiplication operation of vector complex numbers. Background technique [0002] With the continuous development of processor technology, the field of its application is also expanding. In order to meet the needs of high-end applications such as high-performance computing and digital signal processing, combined with the technical trend of the integration of general-purpose CPU (central processing unit) and DSP (digital signal processor), more and more general-purpose processors are used in fast Fourier Fields such as leaf transform (FFT), filtering (FIR) and other data-intensive calculations for dedicated digital signal processing. The field of data-intensive computing puts forward higher and higher requirements on the data processing capability of the chip, and all kinds of general-purpose proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F7/57G06F9/38
Inventor 郇丹丹刘宏伟张晓春
Owner LOONGSON TECH CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products