A text subtopic discovery method based on improved LDA
A discovery method and text sub-technology, applied in the direction of text database query, unstructured text data retrieval, text database browsing/visualization, etc. question
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0052] In this example, if figure 1 As shown, a text subtopic discovery method based on improved LDA is carried out as follows:
[0053] Step 1. In this embodiment, the selected document collection is webpage news data, and the content of two weeks is grabbed from the webpage news around three event keywords, a total of more than 12,000 articles, one event is a document collection, and Treat each news data as a document. According to the domain of the event, a domain dictionary is constructed. In this embodiment, the event belongs to the financial field, so the word segmentation of the financial news dictionary and the financial news stop word list are constructed for pre-text preprocessing. The preprocessing steps include: removing stop words and word segmentation. Record the preprocessed document collection as D={D 1 ,...,D d ,...,D |D|}, where D d Indicates the preprocessed document of the dth article, 1≤d≤|D|, |D| indicates the total number of document collections; ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com