Short text topic recognition method based on Dirichlet variational auto-encoder
A technology of autoencoder and recognition method, which is applied in the field of short text, can solve the problems of accelerated model training, short text topic model feature sparseness, etc., achieve the effect of simple training, alleviate the problem of topic redundancy, and improve the efficiency of topic recognition
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0045] Such as Figure 1 to Figure 2 Shown is the first embodiment of a short text topic recognition method based on Dirichlet variational autoencoder of the present invention. A short text topic recognition method based on Dirichlet variational autoencoder, which includes the following specific steps:
[0046] S1. Preprocess the short text data set, segment words, remove stop words, punctuation marks and numbers, and obtain the text feature vector of the data set;
[0047] S2. Perform clustering based on the text feature vector training obtained by the preprocessing of step S1, and determine the category to which each short text in the short text collection belongs, and this category is used as supplementary feature information of the short text;
[0048]S3. Construct a conditional variational neural topic model based on the text feature vector obtained in step S1 and the supplementary feature information of the short text obtained in step S2, and obtain the document-topic d...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com