Method for identifying microblog key users based on improved Page Rank

A key user and identification method technology, applied in special data processing applications, website content management, instruments, etc., can solve the problem of computing time and space consumption being difficult to meet requirements, and achieve improved efficiency, improved efficiency and quality, strong robustness sexual effect

Inactive Publication Date: 2014-01-22
BEIHANG UNIV
View PDF7 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, considering that the Weibo platform has the characteristics of massive data, the dynamically generated network is often huge in scale and requires instant calculation results. Therefore, the traditional power iteration algorithm can hardly meet the demand in terms of calculation time and space.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for identifying microblog key users based on improved Page Rank
  • Method for identifying microblog key users based on improved Page Rank
  • Method for identifying microblog key users based on improved Page Rank

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0020] The present invention will be further described below in conjunction with the accompanying drawings and specific implementation examples.

[0021] The invention proposes an improved PageRank-based method for identifying key users of a microblog platform. The method takes PageRank as the core, adopts MapReduce parallel computing technology to overcome the problem of low computational efficiency of microblog big data, extracts forwarding information from microblog text structure to construct a network formed by forwarding relationships, and finally uses PageRank to obtain a highly robust and High-quality key user identification results. By using the forwarding relationship, a dynamic forwarding network with high quality related to the query can be obtained, which can overcome the inferior solutions caused by short text, relevance, and static network structure to a certain extent; multiple dynamic forwarding networks are calculated separately by PageRank and combined to im...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying microblog key users based on an improved Page Rank. The method comprises the steps that microblog information data are input, wherein the microblog information data comprise n microblogs; word segmentation is conducted on texts of the n microblogs; according to a word segmentation result, a reverse index structure is established, so that retrieval is conveniently conducted according to appointed keywords; according to the retrieved relevant microblog, forwarding hierarchy information of the microblog is extracted and a weighting directed graph is established, wherein the weighting directed graph is a forwarding network G; the forwarding network G is divided into a plurality of maximum connected subgraphs Gi; the Page rank algorithm is applied to each sub network Gi according to the parallelization computing technology; computing results of the sub networks are combined, so that ranking results of the whole network G are generated; the first m ranking results of the ranking results are selected, serve as the key users and are output. According to the method for identifying the microblog key users based on the improved Page Rank, the parallelization computing technology is adopted, a dynamic forwarding network of a microblog platform is ranked and computed in a big data environment, so that the key users in the information transmission process are identified, and the method is applied to the fields of network public opinion analysis and the like.

Description

technical field [0001] The invention relates to a microblog key user identification method, in particular to an improved PageRank-based microblog key user identification method, which belongs to the field of complex network and data mining, and is especially aimed at analyzing massive microblog data. Background technique [0002] The key users of the Weibo platform are those who play an important role in the dissemination and diffusion of information. Key users play an important intermediary or filtering role in the formation of mass communication effects. They spread information to the audience and form a cascading dissemination of information. Therefore, the identification of key users plays an important role in information discovery and dissemination analysis, and has great guiding significance for network public opinion analysis and other work. However, content-based key user identification is often not accurate enough due to the characteristics of short texts on Weibo;...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/958
Inventor 程工刘春阳张旭庞琳吴俊杰韩洋刘洪甫韩小汀
Owner BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products