Parallel community discovery method and device
A community discovery and social network technology, applied in the field of community discovery solutions, it can solve problems such as computing bottlenecks, and achieve the effects of high fault tolerance, improved stability, and good adaptability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0058] An embodiment of the present invention provides a community discovery method including:
[0059] Step 1) The input module reads the original text file of social network data from HDFS (Hadoop Distributed File System), models it as an authorized undirected graph model or an unweighted undirected graph, and converts the graph similarity matrix S( Adjacency matrix W) distributed storage on HDFS;
[0060] Step 2) The Laplacian generation module of the community discovery system, the degree matrix D and the Laplacian matrix L of the adjacency matrix of the calculation graph are calculated on the computing cluster configured with the Hadoop environment sym =I-D -1 / 2 SD -1 / 2 ;
[0061] Step 3) The eigendecomposition module of the community discovery system uses the Haoop framework to solve the parallel Lanczos numerical values of the eigenvalues and eigenvectors of the Laplacian matrix, and obtains the first K largest eigenvalues I=λ of the matrix 1 ≥λ 2 ≥…≥λ K , a...
Embodiment 2
[0127] This embodiment provides a system for implementing community discovery, including:
[0128] The input module reads in the original social network data, converts it into the form of adjacency matrix and stores it on the HDFS file system;
[0129] Specifically, the input module reads in the original social network data, models it as a weighted undirected graph model or an unweighted undirected graph model, and distributes and stores the adjacency matrix W of the graph obtained after modeling on HDFS.
[0130] Among them, the original social network data is a text file stored on the HDFS file system, and the format of each line is "username 1 username 2 relationship weight", indicating the relationship strength between two users; or stored in the HDFS file Sequence files on the system; or relational data stored in the database of the Hadoop platform.
[0131] The Laplacian generation module calculates the degree matrix D and the Laplacian matrix L of the adjacency matrix ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com