Zero Coefficient Skipping Convolution Neural Network Engine

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
a neural network engine and zero coefficient technology, applied in the field of system and method for performing matrix convolution, can solve the problems of high computation and memory bandwidth of many machine learning applications using convolutional neural networks (cnn)

Inactive Publication Date: 2018-02-15

VIVANTE CORPORATION

View PDF0 Cites 22 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Benefits of technology

The present invention is related to systems and methods for performing matrix convolution in a more efficient way, particularly for use in Convolutional Neural Networks (CNN). The invention provides an improved approach for implementing CNN by incorporating a zero coefficiency skimming technique and optimizing the use of matrix multiplication. The invention also includes methods for computing convolution of a kernel with input data, as well as computer program products and computer systems for carrying out the invention. The technical effects of the invention include reducing the requirement for high computation and memory bandwidth, improving the efficiency of implementing CNN, and reducing the latency of the system.

Problems solved by technology

Many machine learning applications using Convolutional Neural Networks (CNN) require very high computation and memory bandwidth.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0010]It will be readily understood that the components of the present invention, as generally described and illustrated in the Figures herein, could be arranged and designed in a wide variety of different configurations. Thus, the following more detailed description of the embodiments of the invention, as represented in the Figures, is not intended to limit the scope of the invention, as claimed, but is merely representative of certain examples of presently contemplated embodiments in accordance with the invention. The presently described embodiments will be best understood by reference to the drawings, wherein like parts are designated by like numerals throughout.

[0011]Embodiments in accordance with the present invention may be embodied as an apparatus, method, or computer program product. Accordingly, the present invention may take the form of an entirely hardware embodiment, an entirely software embodiment (including firmware, resident software, micro-code, etc.), or an embodime...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A convolution engine, such as a convolution neural network, operates efficiently with respect to sparse kernels by implementing zero skipping. An input tile is loaded and accumulated sums are calculated for the input tile for non-zero coefficients by shifting the tile according to a row and column index of the coefficient in the kernel. Each coefficient is applied individually to tile and the result written to an accumulation buffer before moving to the next non-zero coefficient. A 3D or 4D convolution may be implemented in this manner with separate regions of the accumulation buffer storing accumulated sums for different indexes along one dimension. Images are completely processed and results for each image are stored in the accumulation buffer before moving to the next image.

Description

RELATED APPLICATION[0001]This application claims the benefit of U.S. Provisional Application Ser. No. 62 / 373,518 entitled ZERO COEFFICIENT SKIPPING CONVOLUTION NEURAL NETWORK ENGINE and filed Aug. 11, 2016, which is hereby incorporated herein by reference in its entirety.BACKGROUNDField of the Invention[0002]This invention relates to systems and methods for performing matrix convolution, such as for use in implementing a convolution neural network.Background of the Invention[0003]Many machine learning applications using Convolutional Neural Networks (CNN) require very high computation and memory bandwidth. One way to reduce the requirement is to zero prune the coefficients and skip the computation when a coefficient is zero. These existing software and hardware optimization techniques are based on matrix multiplications. One example is the Sparse Matrix Multiplication technique described in Sparse Convolutional Neural Networks (Baoyuan Liu, Min Wangl, Hassan Forooshl, Marshall Tappe...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N3/04G06F17/16

CPCG06F17/16G06N3/04G06N3/063G06T1/20G06F7/5443G06F2207/4824H04N19/42G06N3/082G06F17/153G06N3/045H04N19/132G06F7/764

Inventor LO, MANKIT

Owner VIVANTE CORPORATION

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Zero Coefficient Skipping Convolution Neural Network Engine

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Benefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology