Method for solving text similarity based on Gini index
A technology of text similarity and Gini index, applied in the field of semantic network, can solve the problems of low efficiency and accuracy of the text similarity algorithm of synonyms and polysemous words, and does not consider the importance and contribution of characteristic vocabulary sets, etc., and achieves great use value , high accuracy, high accuracy effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0025] In order to solve the problem of high-dimensional sparse feature vectors, the importance and contribution of the feature vocabulary set to the text, the problem of synonyms and polysemous words, and the low efficiency and accuracy of the text similarity algorithm, combined with Figure 1-Figure 3 The present invention has been described in detail, and its specific implementation steps are as follows:
[0026] Step 1: Use Chinese word segmentation technology to separate the two texts (w 1 ,w 2 ) for word segmentation processing, its specific word segmentation technology process is as follows:
[0027] Step 1.1: According to the "word segmentation dictionary", find the word in the sentence to be segmented that matches the dictionary, scan the Chinese character string to be segmented completely, search and match in the dictionary of the system, and mark the words in the dictionary when encountering them ; If there is no relevant match in the dictionary, simply split the ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com