New word discovery method and system based on word vector representation in massive texts
A new word discovery and word vector technology, applied in special data processing applications, instruments, biological neural network models, etc., can solve problems such as high cost, poor portability, and complex calculation of statistical indicators, and achieve simple, efficient implementation, and high accuracy rate effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0032] In order to make the purpose, technical solution and advantages of the present invention clearer, the specific implementation manners of the present invention will be clearly and completely described below.
[0033] figure 1 A flowchart showing a method for discovering new words based on word vector representation in massive texts according to an embodiment of the present invention.
[0034] The method comprises: step S1, preprocessing the corpus of the new word discovery task; step S2, performing n-gram word string mining on the preprocessed corpus to obtain n-gram candidate word strings in the corpus; step S3 , set the word vector, and perform pruning according to the similarity of the corresponding word vector between the word in the n-gram candidate word string and the word, to obtain a new word.
[0035] First, in step S1, the corpus of the new word discovery task is preprocessed. The purpose of this embodiment is to find new words in the corpus of the new word d...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


