Hardware acceleration implementation system and method for RNN forward propagation model based on transverse pulsation array
A systolic array, forward propagation technology, applied in the field of hardware acceleration implementation system of RNN forward propagation model, can solve the problems of non-configurability, poor flexibility, and inability to meet the computing network.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0048] The hardware acceleration implementation system of the RNN forward propagation model based on the transverse systolic array, such as figure 1 As shown, it includes a data control unit, a forward propagation calculation unit and a data cache unit, the data control unit is used to receive and generate control signals, and at the same time control the transmission and calculation of data between modules; the forward propagation calculation The unit is used to transmit the data into the transverse pulsation array, sequentially calculate the hidden layer neurons and the output layer neurons, and complete the RNN forward model operation; the data cache unit is used to provide storage space for participating in the calculation and obtaining the calculation results, The data are all 16-bit fixed-point numbers.
[0049] The data transmitted by the forward propagation calculation unit to the lateral pulsation array at least includes an input vector x, weight matrices U, W, V and ...
Embodiment 2
[0061] A method for implementing hardware acceleration of the RNN forward propagation model based on a transverse systolic array includes the following steps:
[0062] S1. Initialization step: configure network parameters, the parameters at least include the number of nodes in the input layer, hidden layer, and output layer, time series length and batches to be processed.
[0063] S2, the calculation step of the hidden layer neurons: the data is passed into the horizontal pulsation array, and the hidden layer neurons are calculated based on the horizontal pulsation array; the weights in the calculation are designed in blocks, and the weight matrix calculated by the hidden layer is divided into blocks by rows , calculate h t =Φ(Ux t +Wh t-1 +b), where x t Input vector for the current moment, h t-1 input vector x for the hidden layer for the previous time instant t-1 The excitation value of the RNN network is generated through the matrix multiplication vector and vector sum...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com