Word vector configuration method and device, storage medium and electronic device
Patent Information
- Authority / Receiving Office
- CN ยท China
- Current Assignee / Owner
- PING AN TECH (SHENZHEN) CO LTD
- Publication Date
- 2019-11-05
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The present invention relates to the field of neural networks, in particular to a word vector configuration method, device, storage medium, and electronic device. Background technique
[0002] When processing text data, the most basic steps are usually word segmentation and training word vectors (for example, using the word2vec method for training), and then perform subsequent tasks such as text comparison and classification based on word vectors. In the actual processing process, it often happens that the text to be processed contains new words (unregistered words) that are not within the scope of the word vector dictionary. The usual processing method is to randomly assign word vectors to unregistered words. However, random assignment The word vector of the new word does not utilize the semantic information of the new word, resulting in a decrease in the accuracy of subsequent tasks.
[0003] Aiming at the above-mentioned problems existing in relate...