Semantic item representation and disambiguation method based on word statistics and WordNet
A technology for disambiguation and words, applied in computing, special data processing applications, instruments, etc., can solve problems such as reducing the accuracy of semantic calculations
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0041] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with specific examples.
[0042] A semantic item representation and disambiguation method based on word statistics and WordNet, such as figure 1 As shown, it specifically includes the following steps:
[0043] Firstly, the offline page file of Wikipedia is obtained, and then the illegal characters in it are converted into spaces, the image table is deleted and only the title is retained, the link is retained in the text, and finally the plain text containing a-z (A-Z range is converted to lowercase) and numbers is left. After the cleaning is completed, the co-occurrence matrix is generated by the word statistical model and the corresponding word vector is obtained therefrom, and the initial word vector is finally formed as the input of the semantic item generation model. It is to use the synset...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com