Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Zero Coefficient Skipping Convolution Neural Network Engine

a neural network engine and zero coefficient technology, applied in the field of system and method for performing matrix convolution, can solve the problems of high computation and memory bandwidth of many machine learning applications using convolutional neural networks (cnn)

Inactive Publication Date: 2018-02-15
VIVANTE CORPORATION
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Benefits of technology

The present invention is related to systems and methods for performing matrix convolution in a more efficient way, particularly for use in Convolutional Neural Networks (CNN). The invention provides an improved approach for implementing CNN by incorporating a zero coefficiency skimming technique and optimizing the use of matrix multiplication. The invention also includes methods for computing convolution of a kernel with input data, as well as computer program products and computer systems for carrying out the invention. The technical effects of the invention include reducing the requirement for high computation and memory bandwidth, improving the efficiency of implementing CNN, and reducing the latency of the system.

Problems solved by technology

Many machine learning applications using Convolutional Neural Networks (CNN) require very high computation and memory bandwidth.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Zero Coefficient Skipping Convolution Neural Network Engine
  • Zero Coefficient Skipping Convolution Neural Network Engine
  • Zero Coefficient Skipping Convolution Neural Network Engine

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0010]It will be readily understood that the components of the present invention, as generally described and illustrated in the Figures herein, could be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of the embodiments of the invention, as represented in the Figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of certain examples of presently contemplated embodiments in accordance with the invention. The presently described embodiments will be best understood by reference to the drawings, wherein like parts are designated by like numerals throughout.

[0011]Embodiments in accordance with the present invention may be embodied as an apparatus, method, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.), or an embodime...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A convolution engine, such as a convolution neural network, operates efficiently with respect to sparse kernels by implementing zero skipping. An input tile is loaded and accumulated sums are calculated for the input tile for non-zero coefficients by shifting the tile according to a row and column index of the coefficient in the kernel. Each coefficient is applied individually to tile and the result written to an accumulation buffer before moving to the next non-zero coefficient. A 3D or 4D convolution may be implemented in this manner with separate regions of the accumulation buffer storing accumulated sums for different indexes along one dimension. Images are completely processed and results for each image are stored in the accumulation buffer before moving to the next image.

Description

RELATED APPLICATION[0001]This application claims the benefit of U.S. Provisional Application Ser. No. 62 / 373,518 entitled ZERO COEFFICIENT SKIPPING CONVOLUTION NEURAL NETWORK ENGINE and filed Aug. 11, 2016, which is hereby incorporated herein by reference in its entirety.BACKGROUNDField of the Invention[0002]This invention relates to systems and methods for performing matrix convolution, such as for use in implementing a convolution neural network.Background of the Invention[0003]Many machine learning applications using Convolutional Neural Networks (CNN) require very high computation and memory bandwidth. One way to reduce the requirement is to zero prune the coefficients and skip the computation when a coefficient is zero. These existing software and hardware optimization techniques are based on matrix multiplications. One example is the Sparse Matrix Multiplication technique described in Sparse Convolutional Neural Networks (Baoyuan Liu, Min Wangl, Hassan Forooshl, Marshall Tappe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06F17/16
CPCG06F17/16G06N3/04G06N3/063G06T1/20G06F7/5443G06F2207/4824H04N19/42G06N3/082G06F17/153G06N3/045H04N19/132G06F7/764
Inventor LO, MANKIT
Owner VIVANTE CORPORATION
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products