Deep Transformer cascade neural network model compression algorithm
A neural network model and compression algorithm technology, applied in the field of deep Transformer cascaded neural network model compression algorithm, to achieve the effect of reducing model size and low computational cost
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0020] The technical solutions of the present invention will be further described below using preferred embodiments of the present invention in conjunction with the accompanying drawings, but the present invention is not limited to these embodiments.
[0021] See attached figure 1 , a deep Transformer cascaded neural network model compression algorithm provided by an embodiment of the present invention, comprising:
[0022] Step A: Pre-train the deep Transformer cascaded neural network on the text data set. The pre-training is specifically to perform self-supervised pre-training on the deep Transformer cascaded neural network model on the unlabeled text data set. The training task is to mask words Prediction and text prediction before and after, update the parameters of the model through the backpropagation algorithm and the gradient descent algorithm, and obtain the pre-training model.
[0023] Step B: Divide the Transformer cascaded model into several modules in sequence. ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


