Text subtopic discovery method based on improved lda
A discovery method and text sub-technology, which is applied in the directions of text database query, unstructured text data retrieval, text database browsing/visualization, etc., can solve problems such as lack of universal applicability, poor comprehensibility of keywords, labor-intensive, etc. question
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0052] In this example, if figure 1 As shown, a text subtopic discovery method based on improved LDA is carried out as follows:
[0053] Step 1. In this embodiment, the selected document collection is webpage news data, and the content of two weeks is grabbed from the webpage news around three event keywords, a total of more than 12,000 articles, one event is a document collection, and Treat each news data as a document. According to the domain of the event, a domain dictionary is constructed. The event in this example belongs to the financial field , Therefore, a financial news dictionary word segmentation and a financial news stop word list are constructed for pre-text preprocessing. The preprocessing steps include: removing stop words and word segmentation. Record the preprocessed document collection as D={D 1 ,...,D d ,...,D |D|}, where D d Indicates the preprocessed document of the dth article, 1≤d≤|D|, |D| indicates the total number of document collections; and t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com