Multiply-accumulate instruction processing method and multiply-accumulate instruction processing device

A technology for instruction processing and multiply-accumulate, which is applied in instruction analysis, register devices, machine execution devices, etc., and can solve serious overflow problems in multiply-accumulate operations

Pending Publication Date: 2020-10-23
C SKY MICROSYST CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In addition, with the widespread use of SIMD instructions that can perform the same operation on multiple sets of data in parallel in the VDSP instructions of the vector digital signal processing instruction set, the overflow problem of multiply-accumulate operations has become increasingly serious

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multiply-accumulate instruction processing method and multiply-accumulate instruction processing device
  • Multiply-accumulate instruction processing method and multiply-accumulate instruction processing device
  • Multiply-accumulate instruction processing method and multiply-accumulate instruction processing device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure, and to fully convey the scope of the present disclosure to those skilled in the art.

[0038] figure 1 is a schematic diagram of an instruction processing device 100 according to an embodiment of the present invention. The instruction processing apparatus 100 has an execution unit 140 comprising circuitry operable to execute instructions, including multiply-accumulate instructions according to the present invention. In some embodiments, the instruction processing device 100 may be a processor, a processor core of a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an instruction processing device. The instruction processing device comprises: a first register suitable for storing first source data; a second register adapted to store second source data; a third register suitable for storing accumulated data; a decoder suitable for receiving multiply-accumulate instructions and decoding the multiply-accumulate instructions; and an execution unit. The multiply-accumulate instruction indicates that the first register is used as a first operand, the second register is used as a second operand, and the third register is used as a thirdoperand and a shift identifier. The execution unit is coupled to the first register, the second register, the third register and the decoder, executs the decoded multiply-accumulate instruction for obtaining first source data from the first register, and acquiring second source data from a second register, performing multiplication on the first source data and the second source data to obtain a multiplication result, shifting the multiplication result according to the shifting identifier, and adding the shifted multiplication result to accumulated data in a third register to obtain a multiply-accumulate result. The invention further discloses a corresponding instruction processing method, a computing system and a system on chip.

Description

technical field [0001] The present invention relates to the field of processors, in particular to a processor core and a processor with an instruction set of multiply-accumulate instructions. Background technique [0002] In digital signal processing, the multiply-accumulate operation is a common operation, which is widely used in filter operations such as Fast Fourier Transform (FFT), FIR / FIR, convolution operations, and matrix operations. Continuous accumulation of a large number of multiplication results will cause the value of the accumulation result to increase continuously, even exceeding the maximum range that can be expressed by the number of bits storing the accumulation result, that is, overflow. [0003] For the problem of multiply-accumulate result overflow, existing solutions include: [0004] 1. Perform a saturation operation on the result, so that when the result overflows, the result can be represented by the value closest to the overflow value as much as po...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/10
CPCG06F17/10G06F7/5443G06F9/3001G06F9/30014G06F9/30098G06F9/30145G06F9/30094G06F7/523
Inventor 罗嘉蕙陈志坚郭宇波张文蒙
Owner C SKY MICROSYST CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products