Block floating point computations using reduced bit-width vectors

A block floating-point and vector technology, which is applied in calculations using number system representations, calculations using non-contact manufacturing equipment, calculations, etc., can solve problems such as reduction and adverse effects on accuracy
CN112074806APending Publication Date: 2020-12-11MICROSOFT TECH LICENSING LLC

Patent Information

Authority / Receiving Office
CN · China
Current Assignee / Owner
MICROSOFT TECH LICENSING LLC
Publication Date
2020-12-11

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

A system for block floating point computation in a neural network receives a block floating point number comprising a mantissa portion. A bit-width of the block floating point number is reduced by decomposing the block floating point number into a plurality of numbers each having a mantissa portion with a bit-width that is smaller than a bit-width of the mantissa portion of the block floating point number. One or more dot product operations are performed separately on each of the plurality of numbers to obtain individual results, which are summed to generate a final dot product value. The final dot product value is used to implement the neural network. The reduced bit width computations allow higher precision mathematical operations to be performed on lower-precision processors with improved accuracy.
Need to check novelty before this filing date? Find Prior Art

Description

Background technique

[0001] Block floating-point number format allows dynamic range and precision to be scaled independently. By reducing the precision, the system performance of a processor (such as a hardware accelerator) can be increased. However, reduced precision may affect system accuracy. For example, the block floating-point number format can be used for neural networks that can be implemented in many application areas for tasks such as computer vision, robotics, speech recognition, medical image processing, computer games, augmented reality, virtual reality, etc. . While reduced precision can improve the performance of different functions of neural networks (including the speed at which classification and regression tasks are performed for object recognition, lip reading, speech recognition, detecting unusual transactions, text prediction, and many others), However, accuracy may be adversely affected. Contents of the invention

[0002] This Summary is provided t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More