Microblog cell division method based on user comprehensive similarities

A technology of comprehensive similarity and community division, which is applied in the field of microblog community division based on user comprehensive similarity, can solve problems such as high complexity and inability to converge, and achieve the effect of improving stability and accuracy

Inactive Publication Date: 2018-03-30
JIANGSU UNIV +1
View PDF2 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In 2007, Raghavan et al. proposed the Label propagation algorithm (Label propagation Algorithm, LPA), which effectively solved the problem of high complex

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Microblog cell division method based on user comprehensive similarities
  • Microblog cell division method based on user comprehensive similarities
  • Microblog cell division method based on user comprehensive similarities

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0049] The present invention designs a microblog community division method based on user comprehensive similarity, and its flow chart is as follows figure 1 shown, including the following steps:

[0050] Step 1: Obtain microblog data, create a collection of blog posts, conduct LDA topic model training, and obtain a 'category-topic' matrix.

[0051] Obtain microblog data, including microblog user information and microblog post information, where microblog user information includes self attributes, fan attributes, and social attributes, and microblog post information includes basic attributes, time attributes, text attributes, and influence attributes.

[0052] Carry out LDA topic model training on the blog post collection, classify all the extracted topic words, and characterize the topic words, and use TF-IDF to calculate the feature weight value, which is the probability of the topic words appearing in the category, and get A 'category-topic' matrix for blog posts.

[0053]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention designs a microblog cell division method based on user comprehensive similarities. According to the specific process of the method, 1, microblog data is acquired, LDA topic model training is performed on a blog article set, and a user topic similarity matrix is obtained through topic mining based on feature extension; 2, a network topological graph with users being nodes and user relations being edges is constructed, and a user comprehensive similarity matrix is obtained according to node link relevancy and topic similarities; and 3, a unique tag is allocated for each node first,the potential influence of each node is evaluated, then the descending order of the potential influences serves as a node selection order, the descending order of node comprehensive similarities serves as a tag update order of the nodes, and finally iterative update of the tags is performed. In this way, cell division can be performed on the microblog users through an improved tag propagation algorithm on the basis of considering the user comprehensive similarities, and the method has high application value for online public opinion monitoring, commercial user mining and the like.

Description

technical field [0001] The invention relates to the technical field of social networks, in particular to a method for dividing microblog communities based on user comprehensive similarity Background technique [0002] How to mine useful information from social networks has become a research hotspot in complex networks, which is of great significance both in theory and in social practical value. Because many social networks have a community structure, especially in large-scale social networks, a common feature of this network structure is that the connections between nodes in the community are very close, and the connections between communities are very sparse. As more and more scholars invest in the research of community division, it is found that the algorithm based on the community structure is characterized by greatly reducing the scale of network processing, which greatly improves the time performance efficiency and avoids the influence of nodes to a certain extent. The...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06Q50/00G06K9/62
CPCG06Q50/01G06F18/214
Inventor 郝梓琳周从华施化吉王润宇刘志锋李雷单田华
Owner JIANGSU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products