Specific group discovery and expansion method based on microblog data

A group and microblogging technology, applied in the field of social network analysis and data mining, can solve problems such as inability to accurately find groups, and achieve the effect of convenient and quick group discovery, reduced complexity, and high modularity

Inactive Publication Date: 2016-06-08
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF5 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Considering that the most accessible and comprehensive information in massive data is the text data information published by social users, we propose a group discovery and expansion method based on text data, which mainly performs natural language processing on user text data and finally extracts The characteristic information of the user is obtained, and modeling is carried out according to the characteristic information. Finally, cluster analysis is carried out by comparing the similarities between users, and finally the group community is obtained, and the main signs of the group are extracted to expand the group, which solves the problem of user In the case of sparse relationship link data, it is impossible to accurately perform group discovery

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Specific group discovery and expansion method based on microblog data
  • Specific group discovery and expansion method based on microblog data
  • Specific group discovery and expansion method based on microblog data

Examples

Experimental program
Comparison scheme
Effect test

specific Embodiment approach

[0023] The overall process of the group discovery and expansion method based on Weibo text data is as follows: figure 1 As shown, taking Sina Weibo's "machine learning" field as an example, we have established a representative feature library of multiple categories of groups and groups according to steps 1 to 5, one of which is the "machine learning" category. , and then found 50 suspected relevant users, the goal is to find out the real relevant users, so as to expand the "machine learning" category. The specific method is to follow steps 1 to 3 of 50 suspected related users to obtain their respective feature vocabulary vectors, and then follow step 6 to perform category matching to determine whether the user belongs to the "machine learning" category. The following is a detailed implementation description according to each step of the algorithm.

[0024] Collect relevant information according to step 1:

[0025] Collect the Sina Weibo data we want to study or directly obta...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a specific group discovery and expansion method based on microblog data, and belongs to the field of social network analysis and data mining. The specific group discovery and expansion method comprises the following specific steps: collecting relevant group information; carrying out information integration and mapping; aiming at text data to carry out characteristic extraction; calculating a user similarity degree; carrying out the self-detection of a category group; and extracting the attributes of the specific group, judging a category, and carrying out group expansion. The specific group discovery and expansion method artfully avoids the problem that group identification can not be carried out since data is sparse or incomplete when a network model is used, inputs large-scale data calculation and is high in stability.

Description

technical field [0001] The method involves the discovery and expansion of some specific text groups in social networks, especially the discovery and expansion of specific groups of microblog data, and belongs to the field of social network analysis and data mining. Background technique [0002] In the social network, users can independently release their own information and also see other people's shared information, and then build a social network in the virtual age. This sharing platform has the characteristics of timely sharing, real-time, interactive, etc., and also has the communication characteristics of traditional social society, and has become an integral part of people's work and life. [0003] In the microblog platform, data mining can obtain high information value for a large amount of text data generated by users. Therefore, it is necessary to use efficient data mining methods and machine learning algorithms to mine useful information and fully extract valuable...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/30G06Q50/00G06K9/62
CPCG06F16/9535G06Q50/01G06F40/289G06F18/23G06F18/24
Inventor 吴松泽张华平徐程程王洋王琦李高超付戈
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products