Deep learning accelerator and method for accelerating deep learning operation

A deep learning and accelerator technology, applied in neural learning methods, instruments, biological neural network models, etc., can solve the problems of heavy computational workload of neural networks and achieve the effect of improving system performance

Pending Publication Date: 2019-10-11
MEDIATEK INC
View PDF6 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] However, the workload of neural network computation is heavy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning accelerator and method for accelerating deep learning operation
  • Deep learning accelerator and method for accelerating deep learning operation
  • Deep learning accelerator and method for accelerating deep learning operation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] The following description is a preferred embodiment of the present invention, which is only used to illustrate the technical features of the present invention, but not to limit the scope of the present invention. While certain terms are used throughout the specification and claims to refer to specific elements, those skilled in the art should understand that manufacturers may use different names for the same element. Therefore, the specification and claims do not use the difference in name as the way to distinguish components, but use the difference in function of the components as the basis for the difference. The terms "element", "system" and "apparatus" used in the present invention may refer to a computer-related entity, where the computer may be hardware, software, or a combination of hardware and software. The terms "comprising" and "comprising" mentioned in the following description and claims are open terms, so they should be interpreted as "including, but not l...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a deep learning accelerator comprising: a plurality of processing components (PE) divided into a plurality of PE groups for performing a computation of a convolution layer by applying multidimensional weights to input activations to generate output activations; the scheduler is used for scheduling the input data in the input activation and the non-zero weight in themulti-dimensional weight to the processing component according to a control mask; wherein the control mask specifies a position of a zero weight of the multi-dimensional weights, and the plurality ofPE groups generate output data of each output channel in the output activation and share the same control mask specifying the same position of the zero weight. Correspondingly, the invention further provides a method for accelerating the deep learning operation. By adopting the invention, the system performance can be improved.

Description

technical field [0001] The present application generally relates to a deep learning computing architecture, and more particularly, to a deep learning accelerator and a method for accelerating deep learning operations. Background technique [0002] Due to its superior performance, deep learning has been widely used in computer vision, speech recognition, natural language processing, bioinformatics and other fields. Deep learning is a branch of machine learning that uses artificial neural networks that contain multiple hidden layers. A type of artificial neural network called a convolutional neural network (CNN) has been used for deep learning on large datasets such as image data. [0003] However, neural networks are computationally intensive. Most neural network calculations involve multiplication and addition calculations. For example, the core computation of a convolutional neural network (CNN) is convolution, which involves high-order nested loops. For feature extract...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04
CPCG06N3/045G06N3/063G06N3/08G06F9/3877G06F9/5027
Inventor 汪威定李翰林郑志崇王绍宇
Owner MEDIATEK INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products