LDA-based text classification method
A text classification and text technology, which is used in text database clustering/classification, unstructured text data retrieval, special data processing applications, etc. Ease of update and maintenance, high availability of results, universal adaptability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0040] Specific embodiments of the present invention will be described in detail below.
[0041] A text classification method based on LDA, such as figure 1 As shown, the Bayesian probability calculation model is used as the text classification model, and a set of feature words that can best reflect the characteristics of the text to be classified is extracted as the feature word set used to input the text classification model. The original feature word set is the original word Set the front part after sorting according to the characteristic weight, use the text classification model to calculate the probability that the combination of characteristic words belongs to each of the predetermined A categories, and take the category with the largest probability value as its category; according to the usual subject classification habits , all subjects can be divided into 75 subject categories, that is, the number of categories A is 75. Use the LDA topic model to assist the text clas...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com