Document similarity recognition method and device based on potential semantic analysis
A document similarity and semantic analysis technology, applied in semantic analysis, text database query, natural language data processing, etc., can solve the problem of losing natural language attributes and achieve good recognition effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0026] In order to clearly illustrate the technical characteristics of the present solution, the present application will be described in detail below through specific implementation modes and in conjunction with the accompanying drawings.
[0027] In the first embodiment, as in figure 1 shown, including the following steps:
[0028] S101. Build an original document library, where the original document library includes several original texts;
[0029] S102. The original text is preprocessed to obtain an original text bag-of-words vector corresponding to the original text;
[0030] The way of preprocessing is as follows: first obtain the word bag model;
[0031] Build a word-text matrix, and assign values to each word in the matrix according to the TF-IDF method;
[0032] Determine the threshold, and use the SVD matrix singular value decomposition method for dimensionality reduction;
[0033] Get the final word-text matrix to get its bag-of-words vector;
[0034] S103. O...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


