Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Hadoop based method for analyzing large-scale social network and analysis platform thereof

A social network and analysis method technology, applied in the field of large-scale social network analysis methods and its analysis platform, can solve problems such as difficulties in social network analysis, insufficient support for social network analysis, unbalanced hardware processing capabilities and algorithm analysis capabilities, etc. , to achieve the effect of expanding the scope of application

Inactive Publication Date: 2016-02-24
CHANGCHUN UNIV OF SCI & TECH
View PDF3 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

With the continuous expansion of the scale of the Internet, the scale of user data continues to increase, making the traditional single-core processing model unsuitable for processing massive data. They often have problems such as insufficient storage and processing capabilities when processing massive data, which makes The analysis of large-scale data becomes more difficult. At the same time, the existing distributed processing platforms are not all open source, and the provided algorithms often have limitations in analysis capabilities.
[0003] At present, most big data analysis platforms operate under single-core conditions and rely on the algorithm itself for analysis. When processing massive data, problems such as insufficient storage and processing capabilities often occur, and system resources are not fully utilized. Efficiency not tall
At the same time, the existing distributed processing platforms are not all open source, and the provided algorithms often have limitations in analysis capabilities, and cannot fully support social network analysis, which brings difficulties to social network analysis.
And the existing big data platform does not have a balance of hardware processing capabilities and algorithm analysis capabilities

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hadoop based method for analyzing large-scale social network and analysis platform thereof
  • Hadoop based method for analyzing large-scale social network and analysis platform thereof
  • Hadoop based method for analyzing large-scale social network and analysis platform thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention will be described in detail below in conjunction with the accompanying drawings.

[0034] like figure 1 As shown, the present invention provides a large-scale social network analysis method based on Hadoop, comprising the following steps:

[0035] 1), obtain the original data in the social network, and store the original data;

[0036] The method of obtaining the original data in the social network in the step 1) is to crawl online or use an open source database to obtain, and the data formats obtained are different, and the data should be processed in a unified manner, and processed into data in a fixed format Files; raw data is stored in a distributed manner.

[0037] 2), carry out unified processing to described raw data, make raw data generate the data file of fixed format;

[0038] 3) Preprocess the data files so that the data files are converted into the data format of the HNAP system, and then establish a data model that supports graph th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a Hadoop based method for analyzing a large-scale social network and an analysis platform thereof. The method comprises the following steps: 1), acquiring raw data on a social network, and storing the raw data; 2), performing unification processing on the raw data, so that a data file in a fixed format is generated by using the raw data; 3), preprocessing the data file to convert the data file into a data format of an HNAP system, and then establishing a data model that supports a graph theory model and a distributed environment; 4), using an algorithm in an algorithm library of the HNAP system to perform social network analysis on data of the data model in the step 3), integrating output results of the analysis, and generating a document file. According to the method and the platform provided by the present invention, the raw data is processed by using a distributed data processing and analysis tool, so that the data file in the fixed format is generated by using the raw data, and an analysis result is acquired through distributed computing, thereby solving the problem that a processing capability of a single core is insufficient.

Description

technical field [0001] The invention relates to the technical field of data batch processing, in particular to a Hadoop-based large-scale social network analysis method and an analysis platform thereof. Background technique [0002] The rise of social networks generates massive user data, which provides conditions for extracting further useful information and mining potential business opportunities. With the continuous expansion of the scale of the Internet, the scale of user data continues to increase, making the traditional single-core processing model unsuitable for processing massive data. They often have problems such as insufficient storage and processing capabilities when processing massive data, which makes The analysis of large-scale data becomes more difficult. At the same time, the existing distributed processing platforms are not all open source, and the provided algorithms often have limitations in analysis capabilities. [0003] At present, most big data analy...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/283G06F16/182
Inventor 王鹏杨迪李松江杨华民邱宁佳高铖
Owner CHANGCHUN UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products