Unlock instant, AI-driven research and patent intelligence for your innovation.

A burst keyword detection method for microblog text stream

A keyword detection and keyword technology, which is applied in the field of Internet information management, can solve the problems of high false positive rate, ignoring the promotion effect of zombie fans, low detection rate, etc., and achieve the effect of improving accuracy

Active Publication Date: 2017-11-28
HARBIN ENG UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The existing microblog burst keyword detection methods do not consider the role of zombie fans in the formation of burst topics and the influence of human life schedules on the accuracy of burst keyword detection methods, and are applied to actual microblog public opinion supervision In the middle, the keywords promoted by zombie fans and influenced by human life may be misjudged as sudden keywords, resulting in a higher false positive rate and a lower detection rate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A burst keyword detection method for microblog text stream
  • A burst keyword detection method for microblog text stream
  • A burst keyword detection method for microblog text stream

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, but not to limit the present invention. In addition, it should be noted that, for the convenience of description, only parts related to the present invention are shown in the drawings but not all content.

[0032] This method uses the trust model to evaluate the credibility of the interaction behavior of microblog users to obtain user credibility. Only the microblog messages of credible users whose user trust is higher than the set trust threshold can be used as dynamics-based breakthroughs. Send the input of the keyword discovery algorithm, combined with the trust model and dynamics-based sudden keyword discovery algorithm to detect sudden keywords in Weibo, so as to reduce the impact of zombie fans in Weibo and human life work a...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of internet information management, in particular to a microblog text stream oriented sudden keyword detecting method. The microblog text stream oriented sudden keyword detecting method comprises the steps of acquiring microblog data in real time and establishing a message session model based on a dynamic sliding window mechanism according to real-time microblog data flows; extracting user credence attributes from the message session model, establishing a dynamic credence model according to the set credence window size and computing the user credence; segmenting the real-time microblog data flows according to the set credence window size, merging the user credence to compute weight of a keyword in each time window and forming a weight sequence of sudden keywords; adopting a sudden keyword discovery algorithm based on a dynamical model to compute sudden weight values of the keywords according to the weight sequence of the sudden keywords, and confirming that the keywords are sudden keywords if the sudden weight values of the keywords are larger than a sudden weight value set by a system. By means of the method, the influence of working and resting time of human can be reduced, and the sudden keyword detecting accuracy can be improved.

Description

technical field [0001] The invention relates to the field of Internet information management, in particular to a sudden keyword detection method for microblog text flow. Background technique [0002] With the official launch of Twitter in 2006 and the rapid development of Web2.0 technology, various social networking platforms based on the Internet have become the most representative applications in the Web2.0 era, among which microblog (hereinafter referred to as Weibo) is the main The platform has attracted the attention of the majority of netizens. Major online media platforms in China, including Sina, Tencent, Sohu, and Netease, have launched their own microblog services since 2009, and microblog has officially entered the mainstream view of Chinese Internet users. [0003] At present, Weibo has become one of the important ways for netizens to obtain information. Weibo has gradually evolved from meeting the social needs of people with weak relationships into a popular pu...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/30G06F17/27
CPCG06F16/951G06F40/205
Inventor 杨武董国忠王巍苘大鹏玄世昌
Owner HARBIN ENG UNIV