A Short Text Clustering Method Based on Deep Semantic Path Search
A path search and clustering method technology, applied in text database clustering/classification, unstructured text data retrieval, special data processing applications, etc., can solve problems such as semantic interference, and achieve the effect of high clustering accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0049] All the features disclosed in this specification, except mutually exclusive features and / or steps, can be combined in any way.
[0050] The present invention will be described in detail below in conjunction with the accompanying drawings.
[0051] A short text clustering method based on deep semantic path search, comprising the following steps:
[0052] Step 1: Preprocessing the general corpus to obtain the vocabulary corresponding to the corpus;
[0053] The preprocessing method is: the sentence in the corpus is subjected to case conversion and word segmentation processing; the words that appear more than N times in the corpus are selected; the words are used as the vocabulary corresponding to the corpus; where N represents words Frequency threshold.
[0054] Step 2: The method of using the hyperparameters of word2vec to establish the real number vector (Embedding) of words is:
[0055] Step S301: mapping the word into a K-dimensional real number vector, and using t...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

