Arithmetic device for neural network, chip, equipment and related method

A computing device and neural network technology, applied in the field of neural networks, can solve the problems of low data processing energy efficiency, multiple data moving operations, and difficult development, and achieve the goal of reducing data moving operations, high processing parallelism, and reducing design complexity. Effect

Inactive Publication Date: 2018-10-23
SZ DJI TECH CO LTD
View PDF8 Cites 11 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Among them, the GPU-based calculation process requires more data movement operations in the entire calculation process, resulting in relatively low energy efficiency in data processing.
However, based on the operation process of the neural network special processor, the instruction set architecture of the neural network special processor requires complex control logic to complete tasks such as instruction fetching and decoding, resulting in a large chip area required for the control logic. In addition, the neural network Dedicated processors require tool chain support such as compilers, making development difficult

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Arithmetic device for neural network, chip, equipment and related method
  • Arithmetic device for neural network, chip, equipment and related method
  • Arithmetic device for neural network, chip, equipment and related method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In deep convolutional neural networks, the hidden layers can be convolutional layers. A set of weight values ​​corresponding to a convolutional layer is called a filter, also known as a convolution kernel. Both the filter and the input eigenvalues ​​are represented as a multidimensional matrix. Correspondingly, a filter represented as a multidimensional matrix is ​​also called a filter matrix, and an input eigenvalue represented as a multidimensional matrix is ​​also called an input characteristic matrix. The operation of the convolution layer is called a convolution operation, which refers to the inner product operation of a part of the eigenvalues ​​of the input feature matrix and the weight value of the filter matrix.

[0038] The operation process of each convolutional layer in the deep convolutional neural network can be compiled into software, and then by running the software in the computing device, the output result of each layer of the network is obtained, that...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Provided are an arithmetic device for a neural network, a chip and equipment, wherein the arithmetic device includes a control unit and a multiplication-accumulation unit, the multiplication-accumulation unit includes a filter register and a plurality of calculation units, and the filter register is connected with the plurality of calculation units; the control unit is used for generating controlinformation and sending the control information to the calculation units; the filter register is used for caching a filter weight value to be subjected to multiplication-accumulation operation; and the calculation units are used for caching input characteristic values to be subjected to multiplication-accumulation operation, and performing multiplication-accumulation operation on the filter weightvalue and the input feature values according to the received control information. All the calculation units are controlled through one control unit, and thus the design complexity of the control unitcan be lowered; and one filter register is shared by the plurality of calculation units, and thus a required cache size can be reduced.

Description

technical field [0001] The present application relates to the field of neural networks, and more specifically, to a computing device, chip, equipment and related methods for neural networks. Background technique [0002] A deep neural network is a machine learning algorithm that is widely used in computer vision tasks such as object recognition, object detection, and semantic segmentation of images. A deep neural network consists of an input layer, several hidden layers, and an output layer. The output of each layer in a deep neural network is the sum of the products of a set of weight values ​​and their corresponding input feature values ​​(i.e. multiply-accumulate). The output of each hidden layer is also called the output feature value, which serves as the input feature value of the next hidden layer or output layer. [0003] A deep convolutional neural network is a deep neural network in which at least one hidden layer operation is a convolution operation. In the curr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F7/52G06N3/04
CPCG06F7/52G06N3/045G06F7/5443G06N3/063G06N3/04
Inventor 韩峰李鹏谷骞
Owner SZ DJI TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products