Cyclic neural network-based training method for Mongolian language models
A technology of recurrent neural network and language model, applied in the training field of Mongolian language model, can solve the problems of lack of semantic information injection, lack of long-distance information description ability, etc.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] 1. Model construction
[0031] The MLMRNN model structure includes an input layer, a hidden layer and an output layer. The input layer is x(t), the hidden layer is s(t), and the output layer is y(t). At time t, the input vector x(t) includes three parts, namely w(t), s(t-1) and f(t), where w(t) represents the one-hot word vector of the word input at time t Indicates; s(t-1) is the output of the hidden layer at time t-1; f(t) is the context word vector trained by Skip-Gram at time t, and its dimension is much smaller than |V|; the hidden layer is s(t ); the output vector is represented by y(t), which includes two parts, one part is a category layer neuron, and the other part is a Mongolian word neuron, c(t) is the category layer of the clustering of word vectors carried out on the vocabulary, output The vector y(t) represents the probability of the next word w(t+1).
[0032] In the network, U, W, and F are the weight matrix between the input layer and the hidden layer,...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


