High-performance RISC-DSP

a high-performance, risc-based technology, applied in the direction of instruments, runtime instruction translation, digital computers, etc., can solve the problems of imposing a substantial burden on the cpu in managing dsp functions, deterministic real-time allocation in sophisticated cpus, and reducing the performance of dsps up to an order of magnitude compared to specialized dsps, etc., to achieve efficient implementation of dsp algorithms

Inactive Publication Date: 2007-10-11
ARM FINANCE OVERSEAS LTD
View PDF28 Cites 26 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

[0006] Systems and methods consistent with the present invention provide for an alternate DSP architecture configuration that allows for more efficient implementation of DSP algorithms and use of CPU resources.

Problems solved by technology

Implementing certain DSP algorithms, such as the FIR Filter or Discrete Cosine Transform (DC7), in software, however, may degrade performance up to an order of magnitude as compared to specialized DSPs.
Another difficulty is the problem of deterministic real-time allocation in sophisticated CPUs.
These approaches, however, impose a substantial burden on the CPU in managing DSP functions.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-performance RISC-DSP
  • High-performance RISC-DSP
  • High-performance RISC-DSP

Examples

Experimental program
Comparison scheme
Effect test

case 1

[0135] 16-Bit Inner Product SUM=SUM+Ai*Bi

[0136] Assuming packed operands, two multiply-adds per cycle:

MADDA2m0,r1,r2MADDA2m0,r3,r4MADDA2m0,r5,r6MADDA2m0,r7,r8. . .

case 2

[0137] 16-Bit Vector Product Loop. Ci=Ai*Bi

[0138] Assuming packed fractional operands, two multiplies per two cycles using two accumulator pairs.

MULTA2m0,r1,r2MFA2m1,r8MULTA2m1,r3,r4MFA2m0,r7. . .

case 3

[0139] 16-bit complex vector product. Ci=Ai*Complex Bi

[0140] Assuming fractional operands packed as 16-bit real, 16-bit imaginary. One complex multiply every two cycles using two accumulator pairs.

CMULTAm0,r1,r2MFA2m1,r8CMULTAm1,r3,r4MFA2m0,r7. . .

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

A DSP superscalar architecture employing dual multiply accumulate pipelines. Dual MAC pipelines allow for a seamless transition between established RISC instruction sets and extended DSP instructions sets. Relocatable opcodes are provide to allow further extensions of RISC instruction sets. The DSP superscalar architecture also provides memory pointers with hardware circular buffer support, an interruptible and nested zero-overhead loop counter, and prioritized low overhead interrupts.

Description

CROSS REFERENCE TO RELATED APPLICATIONS [0001] This application is a continuation of U.S. patent application Ser. No. 09 / 637,500, filed Aug. 11, 2000, which is incorporated herein by reference in its entirety, and which claims the benefit of U.S. Provisional Application No. 60 / 148,652, filed Aug. 13, 1999.FIELD OF THE INVENTION [0002] The present invention relates to digital signal processor (DSP) architectures. BACKGROUND OF THE INVENTION [0003] With the increasing commercial importance of DSP-intensive applications, such as wireless communication, modems, and computer telephony, has come an increasing recognition of the benefit of implementing DSP functions on a CPU. Not only are CPUs usually needed for memory management, user interface and Internet Protocol software, CPUs also have excellent third-party software tool support. [0004] Implementing certain DSP algorithms, such as the FIR Filter or Discrete Cosine Transform (DC7), in software, however, may degrade performance up to a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(United States)
IPC IPC(8): G06F15/76
CPCG06F7/5443G06F9/30014G06F9/30036G06F9/3017G06F9/3857G06F9/3836G06F9/3885G06F9/3893G06F2207/3884G06F9/30174G06F9/3858
Inventor DALLY, WILLIAM J.HAYS, W. PATRICKGELINAS, ROBERTKATZMAN, SOLROSEN, SAMERICSSON, STAFFAN
Owner ARM FINANCE OVERSEAS LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products