Short text recommendation method for user-based biterm topic model
A topic model and recommendation method technology, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problem of not considering the author information of short texts, the quality of topic analysis is difficult to meet the requirements of short text recommendation, loss and so on
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
example 1
[0089] Example 1, Quantitative evaluation of the theme analysis ability of UBTM of the present invention
[0090] 1. Input and output data description
[0091] We apply the method of the present invention to the anonymized data of the actual microblog. The input is a set of microblog data, and the statistics are shown in Table 1: the data set has 101212 short texts, which are divided into 738 according to different users group, each group has an average of 137.14 documents, and the average word length of each document is 29. Several samples of the data are listed below.
[0092] A few samples of short text data
[0093]
[0094]
[0095] The output is the topic analysis quality evaluation index of the UBTM topic model of the present invention.
[0096] 2. Model learning and parameter inference
[0097] First read all microblogs and users corresponding to the microblog, and read a list of stop words in Chinese at the same time. For each microblog, use the stop word l...
example 2
[0111] Example 2, application evaluation in the Weibo recommendation scenario
[0112] 1. Input and output data description
[0113] In this example, we apply the topic analysis of the present invention to the practical application scenario of microblog recommendation. From the 6-month Weibo data, we selected more than 7,000 Weibo with relatively high popularity, and observed 380,000 records that more than 20,000 users reposted or did not repost these 7,000 Weibo. Retweeting can be used as a factual basis for users to like this Weibo. Predicting the behavior of retweeting is the purpose of this experiment: we recommend Weibo to users based on UBTM, and measure the recommendation accuracy and recall according to whether users retweet or not. Rate.
[0114] The selection rules of 380,000 records are as follows: First, we divide the data into training set and test set according to time. For each user, arrange the microblogs forwarded by the user according to time, and select th...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com