A method and apparatus for processing a sequence model
Patent Information
- Authority / Receiving Office
- CN ยท China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- TENCENT TECH (SHENZHEN) CO LTD
- Publication Date
- 2019-03-29
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the field of computer technology, in particular to a sequence model processing method and device. Background technique
[0002] The sequence model can be used to realize various tasks according to the input sequence data, and the sequence model can be realized based on the self-attention neural network (Self-Attention Network, SAN). The sequence model may be a neural machine translation model. For example, there is a Transformer model in the prior art. The Transformer model is based on the aforementioned self-attention neural network and is formed by stacking multiple layers of self-attention neural networks.
[0003] Compared with the neural machine translation model based on Recurrent Neural Network (RNN) in the prior art, the Transformer model uses a self-attention neural network instead of a recurrent neural network to model sequence dependencies. While RNN operates sequentially using loops (that is, the output of each ste...