An apparatus and operating method for performing LSTM neural network operation

A neural network and bias technology, applied in the field of neural network computing, can solve a large number of problems, performance bottlenecks, etc., and achieve the effects of strong scalability, reduced decoding overhead, and reduced number of instructions

Active Publication Date: 2020-06-09
SHANGHAI CAMBRICON INFORMATION TECH CO LTD
View PDF6 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the GPU is a device specially used to perform graphics and image operations and scientific computing, it does not provide special support for LSTM networks, and still requires a lot of front-end decoding work to perform LSTM network operations, which will bring a lot of additional overhead
In addition, the GPU has only a small on-chip cache, and the relevant parameters used in the LSTM network need to be repeatedly moved from off-chip, and the off-chip bandwidth has also become a performance bottleneck.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An apparatus and operating method for performing LSTM neural network operation
  • An apparatus and operating method for performing LSTM neural network operation
  • An apparatus and operating method for performing LSTM neural network operation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific embodiments and with reference to the accompanying drawings.

[0037] In this specification, the various embodiments described below to describe the principles of the present invention are illustrative only and should not be construed as limiting the scope of the invention in any way. The following description with reference to the accompanying drawings is provided to assist in a comprehensive understanding of exemplary embodiments of the present invention as defined by the claims and their equivalents. The following description includes numerous specific details to aid in understanding, but these should be considered as examples only. Accordingly, those of ordinary skill in the art will recognize that various changes and modifications of the embodiments described herein can be made ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An apparatus for performing LSTM neural network operations and an operating method are provided. The device comprises a direct memory access unit, an instruction cache unit, a controller unit, a plurality of data cache units arranged in parallel and a plurality of data processing modules arranged in parallel, wherein the plurality of data processing modules are in one-to-one correspondence with the data cache units, and are used for obtaining input data from the corresponding data cache units and weights and offsets required during operation, and performing LSTM neural network operation; and parallel operation is executed among the plurality of data processing modules. According to the method, special instructions are adopted for operation, the number of instructions required by operationis greatly reduced, and decoding expenditure is reduced; caching the weight and the bias, so that the data transmission overhead is reduced; the method is not limited to the specific application field, can be used in the fields of voice recognition, text translation, music synthesis and the like, and is high in expandability; and the plurality of data processing modules operate in parallel, so that the operation speed of the LSTM network is remarkably improved.

Description

technical field [0001] The present invention relates to the technical field of neural network operations, and more particularly relates to a device and an operation method for performing LSTM neural network operations. Background technique [0002] Long short-term memory network (LSTM) is a time recurrent neural network (RNN). Due to the unique structural design of the network itself, LSTM is suitable for processing and predicting important events with very long intervals and delays in time series. Compared to traditional recurrent neural networks, LSTM networks show better performance and are well suited for learning from experience to classify, process, and predict time series when there is an unknown amount of time between important events. At present, LSTM networks are widely used in many fields such as speech recognition, video description, machine translation and automatic music synthesis. At the same time, with the continuous deepening of research on LSTM networks, t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/063
CPCG06N3/063G06F13/28G06F9/3885G06F9/3802G06F9/30036G06N3/048G06N3/044
Inventor 不公告发明人
Owner SHANGHAI CAMBRICON INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products