Mode-based multiply-add processor for denormal operands

An anti-standard, processor technology, applied in the direction of electrical digital data processing, calculation using digital representation, digital data processing components, etc., can solve the problem of reducing the maximum operating frequency, increasing the minimum cycle time, increasing the silicon area, etc. question

Inactive Publication Date: 2009-05-20
QUALCOMM INC
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, such additional circuitry increases silicon area, increases delay and creates feedthrough lag, thus potentially increasing minimum cycle time and thus reducing maximum operating frequency
Also, denormal numbers are rarely encountered, and optimizing performance for said rare cases at the expense of common cases reduces total processor performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mode-based multiply-add processor for denormal operands
  • Mode-based multiply-add processor for denormal operands
  • Mode-based multiply-add processor for denormal operands

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] figure 1 A functional block diagram of processor 10 is depicted. Processor 10 executes instructions in instruction execution pipeline 12 according to control logic 14 . Control logic 14 includes one or more registers, such as status register 15 , that define various modes of operation. Pipeline 12 may be a superscalar design with multiple parallel pipelines (eg, 12a and 12b). Each pipeline 12a, 12b includes various registers or latches 16 organized into pipeline stages, and one or more arithmetic logic units (ALUs) 18 . Pipeline stage registers or latches 16 and ALU 18 may read operands from and / or write results to registers in general register file 18 .

[0019] The pipelines 12a, 12b fetch instructions from an instruction cache (I-cache or I$) 20 where memory addressing and permission are managed by an instruction-side translation lookaside buffer (ITLB) 22 . Data is accessed from a data cache (D-cache or D$) 24 where memory addressing and permissions are managed ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

In a denormal support mode, the normalization circuit of a floating-point adder is used to normalize or denormalized the output of a floating-point multiplier. Each floating-point multiply instruction is speculatively converted to a multiply-add instruction, with the addend forced to zero. This preserves the value of the product, while normalizing or denormalizing the product using the floating-point adder's normalization circuit. When the operands to the multiply operation are available, they are inspected. If the operands will not generate an unnormal intermediate product or a denormal final product, the add operation is suppressed, such as by operand-forwarding. Additionally, each non-fused floating-point multiply-add instruction is replaced with a multiply-add instruction having a zero addend, and a floating-point add instruction having the addend of the original multiply-add instruction is inserted into the instruction stream. Upon inspection of the operands, if an unnormal intermediate result or a denormal final result will not occur, the addend may be restored to the multiply-add instruction and the add instruction converted to a NOP.

Description

technical field [0001] The present disclosure relates generally to the field of processors and more particularly to a pattern-based method of recoding floating point multiply instructions as floating point multiply-add instructions to compute denormalized operands. Background technique [0002] Microprocessors perform computational operations on numerical values ​​in a wide variety of applications. High execution speed, low power consumption, and small size are important goals for processor designers, especially in embedded applications such as portable electronic devices. Modern processors employ a pipelined architecture in which sequential instructions, each with multiple execution steps, overlap when executed. In a pipelined architecture, each instruction is executed in a series of execution stages, such as fetch, decode, execute, and writeback, each of which may include multiple pipeline stages. The pipeline stage consists of storage elements and logic that executes al...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F7/499G06F7/487G06F7/544
CPCG06F7/49915G06F7/5443G06F7/4876G06F2207/3884G06F7/38G06F7/487G06F7/544
Inventor 肯尼思·艾伦·多克瑟尔帕蒂克·苏尼尔·拉尔
Owner QUALCOMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products