Kalman filter word vector learning method based on Diesel process

A Kalman filter, Kalman filter technology, applied in complex mathematical operations, character and pattern recognition, instruments, etc., can solve the curse of dimensionality, can not well describe the similarity of words and words and other problems
CN108446273BActive Publication Date: 2021-07-20HRG INT INST FOR RES & INNOVATION

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
HRG INT INST FOR RES & INNOVATION
Publication Date
2021-07-20

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

A Kalman filter word vector learning method based on Diesel process, the method includes: training and preprocessing the corpus, generating an LDS language model system, initializing the system parameters, assuming that the process noise satisfies a normal distribution, defining the aggregation class theta t =(μ t ,∑ t ), μ t For the frequency of word t in the corpus, calculate θ t The prior distribution of Dirichlet, the posterior distribution is calculated by Kalman filter derivation and Gibbs sampling estimation, the candidate clusters are extracted by MCMC sampling algorithm, the selection probability of the candidate clusters is calculated, and the candidate with the highest probability value is selected Choose the cluster as θ t , calculate the estimated value of the minimum mean square error of the clustering, substitute the calculation result into the LDS language model, train the model through the EM algorithm, make the model parameters stable, input the preprocessed corpus into the trained LDS language model, and pass Carl The Mann filter updates the formula in one step to compute the implicit vector representation.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the field of natural language processing, in particular, to a method for learning word vectors of Kalman filter based on Diesel process. Background technique

[0002] In natural language processing (NLP) related tasks, in order to hand over natural language to algorithms in machine learning, it is usually necessary to mathematicize the language first, because machines only recognize mathematical symbols. Vectors are things that people abstract from the natural world and hand them over to machines for processing. Word vectors are a way to mathematicize words in language.

[0003] One of the simplest word vector representations is One-hot Representation, which is to use a very long vector to represent a word. The length of the vector is the size of the dictionary. The vector component has only one 1, and the others are all 0 and 1 positions. corresponds to the position of the word in the dictionary. However, this kind of word v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More