Word vector generation method based on Gaussian distribution
A Gaussian distribution and word vector technology, applied in the field of natural language processing, can solve problems such as inability to represent probability distribution and fixed number of word meanings, and achieve the effects of accelerating the text clustering process, good classification effect, and reducing the amount of calculation and communication
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0047] The invention provides a method for generating word vectors based on Gaussian distribution, which first preprocesses the corpus; secondly divides the corpus into contexts by using punctuation marks; then infers word meanings in combination with local and global information, and determines the mapping relationship between words and word meanings; Finally, the word vector is obtained by optimizing the objective function. The specific process of the present invention will be described in detail below with specific implementation methods in conjunction with the accompanying drawings.
[0048] Please refer to figure 1 , a method for generating word vectors based on Gaussian distribution, including the following steps:
[0049] S1. Obtain the training corpus, and preprocess the corpus; the specific method of preprocessing the corpus is: remove stop words and low-frequency words, restore part of speech, convert case, and form an effective corpus; in addition, use python for c...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com