Hybrid theme model construction method for deep learning
A topic model and construction method technology, which is applied in neural learning methods, biological neural network models, unstructured text data retrieval, etc., can solve problems such as insufficient feature extraction, low sample efficiency, long training time, etc., and achieve transferability Strong, low classification error rate, good overall classification effect of the model
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] The present invention realizes by following technical method scheme, as figure 1 As shown, a deep learning mixed topic model construction method is applied to semantic analysis and text mining in the field of natural language processing. It has been extended to the field of bioinformatics, and topic models are often applied to text representation, Dimensionality reduction processing, clustering text by topic, and forming a text recommendation system based on user preferences, etc.
[0031] Currently, there are five topic models: LSA, pLSA, LDA, HDP, and lda2vec, among which:
[0032] LSA is latent semantic analysis (Latent Semantic Analysis), which is one of the foundations of topic modeling. It mainly uses linear algebra theory for semantic analysis. Its core idea is to decompose the owned "document-term" matrix into mutually independent "document-term" Topic" matrix and "topic-term" matrix, the more frequently a term appears in a document, the greater its weight.
...
Embodiment 2
[0055] Using the HTM hybrid topic model not only has good transfer learning ability, but also has strong feature extraction and resource representation capabilities, which can greatly improve the efficiency of sample usage, so that less sample data can achieve optimal performance. Assume that the number of hidden layers in the convolutional network CNN is set to 1, num_filtes is set to 100, the convolution kernel filter_size is set to 3, max_len is set to 50, and the value range of the dropout method is used to solve the overfitting problem In [0.4,0.6], the experiment chooses 0.5 by default, the purpose is to reduce the complex co-adaptability between neurons and improve the generalization ability of the model. Let each neuron not work with a probability of 50%, that is, it is in a sleep state, and does not perform forward score propagation or reverse error transmission. The two groups of original data used in the present invention are respectively from the Huawei cloud commu...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com