Zero Coefficient Skipping Convolution Neural Network Engine

a neural network engine and zero coefficient technology, applied in the field of system and method for performing matrix convolution, can solve the problems of high computation and memory bandwidth of many machine learning applications using convolutional neural networks (cnn)
US20180046898A1Inactive Publication Date: 2018-02-15VIVANTE CORPORATION

Patent Information

Authority / Receiving Office
US ยท United States
Current Assignee / Owner
VIVANTE CORPORATION
Publication Date
2018-02-15
Estimated Expiration
Not applicable ยท inactive patent

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

A convolution engine, such as a convolution neural network, operates efficiently with respect to sparse kernels by implementing zero skipping. An input tile is loaded and accumulated sums are calculated for the input tile for non-zero coefficients by shifting the tile according to a row and column index of the coefficient in the kernel. Each coefficient is applied individually to tile and the result written to an accumulation buffer before moving to the next non-zero coefficient. A 3D or 4D convolution may be implemented in this manner with separate regions of the accumulation buffer storing accumulated sums for different indexes along one dimension. Images are completely processed and results for each image are stored in the accumulation buffer before moving to the next image.
Need to check novelty before this filing date? Find Prior Art

Description

RELATED APPLICATION

[0001] This application claims the benefit of U.S. Provisional Application Ser. No. 62 / 373,518 entitled ZERO COEFFICIENT SKIPPING CONVOLUTION NEURAL NETWORK ENGINE and filed Aug. 11, 2016, which is hereby incorporated herein by reference in its entirety.BACKGROUNDField of the Invention

[0002] This invention relates to systems and methods for performing matrix convolution, such as for use in implementing a convolution neural network.Background of the Invention

[0003] Many machine learning applications using Convolutional Neural Networks (CNN) require very high computation and memory bandwidth. One way to reduce the requirement is to zero prune the coefficients and skip the computation when a coefficient is zero. These existing software and hardware optimization techniques are based on matrix multiplications. One example is the Sparse Matrix Multiplication technique described in Sparse Convolutional Neural Networks (Baoyuan Liu, Min Wangl, Hassan Forooshl, Marshall Tappe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More