Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Instruction execution method for vector complex multiplication operation and corresponding device

A technology of complex number multiplication and instruction execution, applied in the field of microprocessor architecture, can solve the problems of long pipeline series, instruction window control queue blockage, increase of processor area, etc., to save area and cost, avoid expansion, and avoid complexity degree of effect

Active Publication Date: 2010-09-08
LOONGSON TECH CORP
View PDF1 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The disadvantages of this implementation method are: 1. The number of pipeline stages is long, and the pipeline start-up time is long, which easily leads to insufficient instruction windows and blockage of each control queue; 2. Multiplication and addition are separate arithmetic units, which increases the area of ​​the processor. , which increases the hardware implementation cost of the processor; and 3. In order to increase the peak value of the vector complex multiplication operation, it is necessary to increase the corresponding operation function components in parallel, so that the increase in the implementation cost after the instruction vectorization is more obvious

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Instruction execution method for vector complex multiplication operation and corresponding device
  • Instruction execution method for vector complex multiplication operation and corresponding device
  • Instruction execution method for vector complex multiplication operation and corresponding device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] As mentioned above, the present invention aims to reduce the pipeline length and hardware implementation cost of complex vector multiplication operations, and its main idea is to use two instructions to pair to complete vector complex multiplication operations, wherein the first instruction performs vector complex multiplication The multiplication operation, the second instruction does the multiplication and addition operation of vector complex numbers. Also, the two instructions multiplex the vector multiply-accumulate functional unit, and two floating-point single precision and one floating-point double-precision multiplex one vector multiply-accumulate functional unit.

[0024] refer to Figure 9 Shown the block diagram of the vector multiplication and addition functional unit of the present invention, the operation of explaining the vector multiplication and addition functional unit in detail is as follows with the operand being 256 bits: every cycle transmits a vec...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to an instruction execution method for vector complex multiplication operation (a+bj)*(c+dj) in a processor and a corresponding device. The method is characterized by comprising the step of designing two instruction matching pairs to finish the vector complex multiplication operation (a+bj)*(c+dj), wherein the first instruction executes vector multiplication, and the operation numbers of the first instruction comprise (a+bj) and (c+dj) for calculating partial results of the vector complex multiplication operation (a+bj)*(c+dj); and the second instruction executes vector multiplication and addition, and the operation numbers of the second instruction comprise the (a+bj), the (c+dj) and execution results of the first instruction for calculating rest results of the vector complex multiplication operation (a+bj)*(c+dj) and adding the rest results and the partial results to obtain a final result of the vector complex multiplication operation (a+bj)*(c+dj). By designing the two instruction matching pairs to finish the operation, the flow length of the vector complex multiplication operation is a flow grade of the multiplication and addition operation. Moreover, the method can greatly save the area and cost of a chip by complexing vector multiplication and addition functional components of the two instructions.

Description

technical field [0001] The invention relates to the technical field of microprocessor architecture, in particular to an instruction execution method and a corresponding device for multiplication operation of vector complex numbers. Background technique [0002] With the continuous development of processor technology, the field of its application is also expanding. In order to meet the needs of high-end applications such as high-performance computing and digital signal processing, combined with the technical trend of the integration of general-purpose CPU (central processing unit) and DSP (digital signal processor), more and more general-purpose processors are used in fast Fourier Fields such as leaf transform (FFT), filtering (FIR) and other data-intensive calculations for dedicated digital signal processing. The field of data-intensive computing puts forward higher and higher requirements on the data processing capability of the chip, and all kinds of general-purpose proce...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F7/57G06F9/38
Inventor 郇丹丹刘宏伟张晓春
Owner LOONGSON TECH CORP
Features
  • Generate Ideas
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More