Unsupervised keyword extraction method
An extraction method and keyword technology, applied in the field of text processing algorithms, can solve the problems of not directly considering the relevance of phrases, and it is difficult to further improve the extraction effect, so as to achieve the effect of rich semantic information and improved effect.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0042] Such as figure 1 As shown, a faster and more efficient unsupervised keyword extraction method based on word-phrase graph and LDA topic model, the specific process is:
[0043] S1: Preprocessing the document data, including removing stop words, part-of-speech tagging, removing punctuation marks and illegal symbols, etc., to obtain a word set W.
[0044] S2: Use pattern matching combined with syntactic rules to carry out noun phrase chunking (NP-chunking), and specifically use part-of-speech tagging and "adjective + noun" mode to obtain a series of candidate key phrases.
[0045] S3: Use the LDA topic model to calculate the word salience score of each word in the word set obtained in S1, sort in descending order according to the score, and take the top k as the topic word set of this document.
[0046] S4: Use the candidate key phrases obtained in S2 and the topic words obtained in S3 to construct a phrase-word graph.
[0047] S5: According to the graph structure constr...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com