Unsupervised multi-model fusion extraction type text abstracting method
A multi-model, extractive technology, applied in the field of information extraction, can solve problems such as inability to take into account the semantic information of sentences, inability to accurately and comprehensively describe the content of the article, and information redundancy.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0078] The present invention will be described in further detail below in conjunction with the accompanying drawings.
[0079] The present invention provides an unsupervised multi-model fusion extractive text summarization method, such as figure 1 shown, including the following steps:
[0080] S1. When extracting the abstract, the text preprocessing of the document to be processed must first be performed. The specific method is: divide the document to be processed into sentences first, and number each sentence in sequence; then perform word segmentation processing on each sentence, English can be used NLTK tool, Chinese can use jieba tool; remove stop words and invalid symbols, stop words include some modal particles, punctuation marks, articles, function words and other words that have no practical meaning or basically have no effect on sentence meaning.
[0081] S2. Train and optimize the centrality text summary model in advance, and calculate the first batch of summaries s...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com