Multi-document topic discovery method based on two-level clustering
A discovery method and multi-document technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve problems such as uneven distribution, inconvenient discovery of multi-document topics, impact of sentence clustering, etc., to reduce spatial dimensions , highlight the similarity of the subject content, and improve the effect of computing speed
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0041] Such as figure 1 As shown, the multi-document topic discovery method based on two-level clustering in this embodiment includes the following steps:
[0042] S1. Multiple documents are used as input, and each document is preprocessed, including sentence segmentation of documents, word segmentation of sentences, acquisition of noun sets and verb sets in multi-document collections, and disambiguation of polysemous words among them. Processing; wherein the specific method of word sense disambiguation processing is:
[0043] For the result after word segmentation, first mark its part of speech, and only pay attention to the noun set and verb set. For the polysemous word w among them, first use the semantic dictionary to obtain its various meanings, and then calculate each meaning and k words with the same part of speech before and after each The sum of the word similarity.
[0044] The calculation method of the above word similarity is:
[0045] S11. For the similarity of...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com