Low-energy-consumption high-precision approximate parallel fixed-width multiplication and accumulation device

A multiplication-accumulation and high-precision technology, which is applied in the field of low-power consumption and high-precision approximate fixed-width multiplication-accumulation devices, can solve the problems of failing to achieve an effective balance between precision and hardware, rough approximation methods, etc., and shorten the length of the critical path and reduce the number of circuits. Complexity, the effect of increasing parallelism

Active Publication Date: 2022-06-21
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF6 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The paper "A High-Performance and Energy-Efficient FIRAdaptive Filter Using Approximate Distributed Arithmetic Circuits" published in IEEE TRANSACTIONS ONCIRCUITS AND SYSTEMS discloses an adaptive filter design method based on a distributed algorithm, in which the error calculation module and parallel multiplication accumulation The design idea of ​​the unit is consistent, but the approximation method is relatively rough, failing to achieve an effective balance between precision and hardware overhead

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Low-energy-consumption high-precision approximate parallel fixed-width multiplication and accumulation device
  • Low-energy-consumption high-precision approximate parallel fixed-width multiplication and accumulation device
  • Low-energy-consumption high-precision approximate parallel fixed-width multiplication and accumulation device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0049] The approximate 4_2 compressor formula used in a one-stage partial product compression circuit is: , ; Precise adders include precise full adders and precise half adders. The sign compensation bit of the partial product of each row is not included in the compression tree, and the error is reduced by the constant compensation method.

[0050] As a further optimization scheme of this embodiment, the two-level partial product compression circuit performs unified processing on the sign bit when the input symbol is determined: on the premise that only the value bits are reserved, the lowest bit compression tree of the first-level partial product compressor Add "111" to the high bit of any output, and add "110" to the high bit of any output of the second-lowest and second-highest compression trees, such as image 3 shown.

[0051] As a further optimization scheme of this embodiment, the second-level partial product compressor includes a constant compensation part for tru...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a low-energy-consumption high-precision approximate parallel fixed-width multiplication and accumulation device which comprises an input truncation compensation circuit, a base-8 Booth encoder and decoder circuit, a first-stage partial product compression circuit, a second-stage partial product compression circuit and a carry lookahead adder circuit. Wherein in the first-stage partial product compression circuit, a Wallace tree with the weight is cut off at a low position, an approximate 42 compressor is used at a second low position, and an accurate compressor is used at a high position; the second-stage partial product compression circuit uses a precise compressor and comprises a probability constant compensation part which is used for compensating for first-stage partial product truncation, compensating for errors generated by using an approximate 42 compressor and compensating for second-stage partial product truncation. According to the method, power consumption is reduced by using truncation and approximation methods, hardware overhead is reduced, a probability constant compensation strategy is adopted for errors, and high precision is maintained.

Description

technical field [0001] The invention relates to the technical field of approximate arithmetic operation circuit design, in particular to a low-energy-consumption high-precision approximate parallel fixed-width multiplication and accumulation device. Background technique [0002] Since 2007, a series of semiconductor laws such as Moore's Law and Dennard's scaling law have gradually become invalid, and it has become very difficult to continuously improve the performance of chips while maintaining the same energy consumption. And nowadays, the importance of big data processing and artificial intelligence continues to increase. These applications require massive data and complex calculations, which put forward higher requirements for energy-efficient and high-performance general-purpose computing engines and application-specific integrated circuits. Many existing applications such as pattern recognition, video processing, and data mining have their own fault-tolerant capabilitie...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F7/544
CPCG06F7/5443Y02D10/00
Inventor 崔子英陈珂刘伟强崔益军王成华吴比
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products