Academic resource acquisition method based on LDA (latent Dirichlet allocation)
An acquisition method and academic technology, applied in the field of LDA-based academic resource acquisition, can solve the problem that traditional search technology is difficult to cover the different needs of mass users, achieve good topic matching effect, make up for time loss, and improve accuracy and quality Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0060] Specific embodiments of the present invention will be described in detail below.
[0061] A method for obtaining academic resources based on LDA. The academic resources are various electronic documents published on the Internet, including but not limited to various papers, periodicals, news, patent documents, using a subject crawler that can be run by a computer, and using LDA topic model that can be run by computer, LDA topic model such as image 3 As shown; configure a corpus for the LDA topic model, the corpus of the corpus is used for the training of the LDA topic model, and the topic document crawled by the topic crawler is obtained through the calculation of the LDA topic model, and the topic document is a collection of topic related words, such as Figure 4 Shown; The topic crawler further includes a topic determination module, a similarity calculation module, and a URL priority sorting module on the basis of a common web crawler, such as figure 2 As shown; in ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com