Multi-information fusion microblog hot topic detection method

A technology of multi-information fusion and Weibo hotspot, applied in the field of multi-information fusion Weibo hot topic detection, it can solve the problems of data sparsity, lack of in-depth mining and analysis of online social network social relations, and inability to utilize social network, etc. The effect of improving performance

Inactive Publication Date: 2013-09-11
BEIHANG UNIV
View PDF2 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Traditional research on topic detection methods is mainly aimed at Internet information itself. Its methods and technologies are mainly oriented to traditional news document data. The data are regarded as isolated information for research. Without in-depth mining and analysis of the social relationships contained i

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-information fusion microblog hot topic detection method
  • Multi-information fusion microblog hot topic detection method
  • Multi-information fusion microblog hot topic detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] The method of the present invention will be further described in detail below in conjunction with the drawings and the embodiments of the present invention.

[0017] Such as figure 1 As shown, the specific implementation of the present invention is as follows:

[0018] (1) Calculation of the weight of feature words combined with the influence of bloggers

[0019] The data used for the detection of hot topics on Weibo includes web page data crawled from news sites, and Weibo data crawled on Weibo. The raw data crawled contains a lot of noise information and needs to be preprocessed first, including The HTML page is parsed to obtain Web body information and social information, and then word segmentation is performed on the extracted body information, and stop words and common words are removed. The remaining words in Weibo are called feature words. The extracted social information refers to the blogger's fans and following information.

[0020] Weibo is a social network where pe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a microblog hot topic detection method based on multi-information fusion. Firstly, influences of a blogger is calculated according to social relations of a microblog promulgator, and a sum of weights of a feature word in all microblogs in a given time period is further calculated based on influence information of the blogger and frequency information of the feature word; then a sudden feature word is detected according to information of the sum of the weight of the feature word along with time changes, microblog data are expanded by introducing Web news linguistic data to calculate an incidence relation valve between sudden features, and an association graph of the sudden feature word can be further constructed; finally, the association graph of the sudden feature word is divided, each strong connected subgraph indicates a topic, and accordingly detection on a microblog hot topic can be achieved. The microblog hot topic detection method comprehensively uses microblog feature word information, blogger social relation information and relative Web news document information to detect the microblog hot topic, and improves detection efficiency of the microblog hot topic.

Description

Technical field [0001] The invention relates to a microblog hot topic detection method based on multi-information fusion, which can automatically detect new hot topics in the microblog. It can be applied to various types of social media data, and is suitable for data mining in social networks and social network public opinion monitoring. Background technique [0002] With the development of Web2.0 technology, Web-based social networks have become more and more popular. Especially in recent years, online social networks have attracted more and more network users and have become the hottest network platform, social network. User-generated information has become the main source of Internet content. For example, Sina Weibo has nearly 200 million registered users in just over a year, generating more than 80 million Weibo every day, and Sohu Weibo has more than 20 million users. With the increase in the number of users, Weibo has gradually become the main place to reflect social hot ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 张小明李舟军
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products