Word vector generation method and related equipment
Patent Information
- Authority / Receiving Office
- CN · China
- Patent Type
- Applications(China)
- Current Assignee / Owner
- BEIJING GRIDSUM TECH CO LTD
- Publication Date
- 2020-05-26
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The invention relates to the technical field of natural language processing, in particular to a method for generating word vectors and related equipment. Background technique
[0002] Text is a carrier of information and plays an important role in the development of our society. In order for computers to be able to deal with natural language problems, these discrete texts must first be mathematicalized. The easiest way is to use One-hotRepresentation to convert each word into a vector of |V| dimension, where |V| represents the size of the vocabulary. The position corresponding to the word sequence number is 1, and the other positions are 0. In 2003, Yoshua Bengio et al. first applied neural networks to language models, and proposed to use Distributed Representation instead of traditional One-hot Representation to represent word vectors, making word vectors Not only computable, but meaningful. In 2013, Mikolov et al. proposed the Continuous Bag of Wo...