Iterative extraction of Chinese synonyms based on pattern learning
A technology of pattern learning and synonyms, applied in semantic analysis, unstructured text data retrieval, text database clustering/classification, etc. Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0080] Below in conjunction with the method of this technology describe in detail the concrete steps that this example implements, as follows:
[0081] (1) if figure 1As shown, a Lucene index is established for the encyclopedia text, and 5000 pairs of synonyms are randomly selected from the seed thesaurus as seeds; the seed word pairs are used to search in the corpus, and the text between each word pair is extracted as a candidate pattern; for the candidate pattern Carry out clustering, each candidate pattern group is represented by its pattern prototype, count the frequency of the candidate pattern group, keep the candidate pattern group whose frequency is greater than 5;
[0082] (2) if figure 1 As shown, match candidate patterns, and extract entity pairs before and after the pattern in each candidate sentence as candidate synonym pairs;
[0083] (3) if figure 1 As shown, use word2vec to calculate the semantic similarity between word pairs as positive and negative example...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com