Hot news prediction method and system based on microblog data analysis
A technology of data analysis and microblogging, applied in the direction of network data indexing, network data retrieval, and other database retrieval, etc., can solve problems such as the inability to find hot topics and the inability to comprehensively analyze the characteristics of hot topics, and achieve the effect of enhancing practicality
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0041] Such as figure 1 with figure 2 As shown, the hot news prediction method based on microblog data analysis of the present embodiment includes the following steps:
[0042] S1. Collect news reports from mainstream news websites and the microblog user reaction information caused by them on microblogs. The news reports include titles and texts, and the microblog user reaction information uses news titles as keywords to search on microblogs. The microblog result set includes microblog user information, microblog text, posting time, but does not include news reports in microblog by news media;
[0043] S2. Perform word segmentation and word frequency statistics on the microblog text, calculate the TF-IDF (term frequency-inversedocument frequency) value of the word, and convert it to describe a microblog topic using a vector space;
[0044] S3. Classify the microblog topics, describe the three quantitative indicators of the microblog topics, and calculate the three popularit...
Embodiment 2
[0078] Such as image 3 As shown, the hot news prediction system based on microblog data analysis of the present embodiment, the system includes:
[0079] The data collection module is used to collect news reports from mainstream websites and the reaction information of Weibo users on Weibo;
[0080] The text analysis processing module is used to perform word segmentation and word frequency statistics on the microblog text, calculate the TF-IDF value of the word, and convert it into a vector space to describe a microblog topic;
[0081] The data statistical analysis module is used to classify Weibo topics, count and describe various quantitative indicators of Weibo topics, and calculate various popularity indicators of news;
[0082] The hot news prediction module is used to use the multiple linear regression algorithm to learn the sample data, establish a hot news prediction model, and judge whether the following news will become a hot news according to the hot news predicti...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com