Micro-blog hot word and hot topic mining system and method

A hot topic and hot word technology, applied in the field of social networks, can solve problems such as low topic recognition, undiscovered topics, and no intersection allowed

Inactive Publication Date: 2014-03-26
FUZHOU UNIV
View PDF3 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] For the hotspot detection problem of massive microblog data, the main problems of the existing hotword clustering methods are: firstly, the words involved in different topics in the clustering results are not allowed to overlap, which is not consistent with the actual situation, and it is easy to As a result, some topics have not been discovered, or the recognition of topics is very low
In addition, the traditional clustering algorithm has a high time complexity, and it is difficult to adapt to the clustering requirements of massive microblog data.
[0007] In summary, there have been relatively complete technologies and methods for the influence analysis of individual users in social networks, but there are relatively few methods for community-level influence analysis in social networks, and there is a lack of analysis of communities in social networks. Comprehensive analysis and evaluation of influence, in the face of large-scale social network scenarios, existing methods are difficult to meet the requirements in terms of analysis effect and efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Micro-blog hot word and hot topic mining system and method
  • Micro-blog hot word and hot topic mining system and method
  • Micro-blog hot word and hot topic mining system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0055] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0056] figure 1 It is a schematic diagram of the module structure of the microblog hot word and hot topic mining system of the present invention. Such as figure 1 As shown, the system includes: a preprocessing module 100 , a hot word screening module 200 , a hot word co-occurrence network construction module 300 and a hot word clustering module 400 .

[0057] The preprocessing module 100 is used to preprocess the content data released in the social network, obtain candidate hot words, and construct a candidate hot word set with this; The frequency of occurrence and suddenness of words at the current moment and within a given historical time window, calculate the vitality of each candidate hot word, filter out hot words, and build a hot word set with this; the hot word co-occurrence network construction module 300 uses To calculate the corr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of social networks, in particular to a micro-blog hot word and hot topic mining system and method. The method includes the following steps that content data released in a micro-blog are preprocessed to acquire a candidate hot word sequence; according to the frequency of occurrence and suddenness of candidate hot words in a candidate hot word set at the current moment and in a given historical time window, the vitality of each candidate hot word is worked out, and a hot word set is formed by screening the candidate hot words; according to the hot word set formed by screening the candidate hot words, hot word correlation is worked out, and a hot word co-occurrence network is constructed; according to the hot word co-occurrence network, the hot word set is partitioned through the hot word clustering algorithm based on multi-label propagation to acquire a hot topic set. By means of the micro-blog hot word and hot topic mining system and method, efficient micro-blog hot word and hot topic mining is achieved, and mining precision and processing efficiency are improved.

Description

technical field [0001] The invention relates to the technical field of social network, in particular to a microblog hot word and hot topic mining system and method. Background technique [0002] With the rise of Weibo, people's participation is constantly increasing. Users can publish what they see and hear anytime and anywhere through computers and mobile phones, and realize instant sharing. Now Weibo has become a fashion on the Internet, and it is also an important place for generating and discussing hot topics. Hot topics refer to topics that frequently appear on the Internet within a period of time and are widely concerned and discussed by people. The exponential growth of microblog information makes how to effectively control massive amounts of information and extract hot topics become an urgent problem to be solved. [0003] For hot topic detection, the traditional method is to cluster the text, but this method is not conducive to the user's intuitive identification o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/35G06F16/958
Inventor 陈羽中郭文忠陈国龙方明月
Owner FUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products