A method for identifying the similarity of a large amount of web text information based on word net
A text information and similarity technology, applied in character and pattern recognition, digital data information retrieval, network data retrieval, etc., can solve the problem of not taking into account the ambiguity of end-user query methods, unclear query request target results, and query request content. There are no problems such as pertinence, so as to improve retrieval efficiency and quality of retrieval results, optimize storage and index structure, and eliminate content plagiarism.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0057] The present invention will be further described below in conjunction with accompanying drawing:
[0058] The method for identifying a large amount of Web text information similarities based on word nets of the present invention comprises the following steps:
[0059] (1) build word net, comprise the following steps:
[0060] 1.1. Extract text information from Web pages to form a document set D composed of multiple documents d, extract feature words from a document d in the document set D, and calculate any two f of all feature words i , f j The normalized mutual information value norm_I between the two ij and norm_I ji , according to the calculated norm_I ij and norm_I ji value to build the feature word f i , f j Mutual information relationship between word pairs i , f j > and j , f i >, norm_I ij As a mutual information relation word pair i , f j > weights, norm_I ji As a mutual information relation word pair j , f i >The weight value, when norm_I ij =nor...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com