Method, device and system for processing massive data of graph structure

A graph structure, large-scale technology, applied in the field of distributed computing, can solve the problems of reducing computing efficiency, destroying computing locality, increasing communication traffic, etc., and achieve the effect of improving computing efficiency, reducing communication traffic, and reducing the demand for bandwidth resources

Active Publication Date: 2014-03-12
SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
View PDF1 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Giraph relies on the effective partitioning of graph-structured data to reduce the amount of communication (mainly sending and receiving messages) and maintain load balance. However, the partitioning of graph-structured data is based on random partitioning of hash functions, thus destroying

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method, device and system for processing massive data of graph structure
  • Method, device and system for processing massive data of graph structure
  • Method, device and system for processing massive data of graph structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] An embodiment of the present invention provides a large-scale graph structure data processing method, including: reading the graph structure data from the node into the memory; performing preprocessing on the graph structure data in the memory to obtain the adjacent The vertices are in at least one data slice of the same data slice; the at least one data slice obtained by the preprocessing is mapped to a slave node; the slave node uses an iterative algorithm to calculate the data slice mapped to the slave node. Embodiments of the present invention also provide a corresponding large-scale graph structure data processing device and system. Each will be described in detail below.

[0032] The basic flow of the large-scale graph structure data processing method in the embodiment of the present invention can be referred to figure 1 , mainly including the following steps S101 to S104:

[0033] S101, the slave node reads the graph structure data into the memory.

[0034] In...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method, device and system for processing massive data of a graph structure. Data computational efficiency can be improved, and operational reliability of a system can be improved. The method for processing the massive data of the graph structure comprises the steps of using a slave node to read the graph structure data to a memory; preprocessing the graph structure data in the memory to obtain at least one data piece in the graph structure data, wherein every two adjacent vertexes of the data piece are arranged on the same data piece; mapping the data piece obtained through preprocessing to the slave node; using the slave node to compute the data piece which is mapped to the slave node through an iterative algorithm. Due to the fact that information of every two adjacent vertexes is mapped to the same slave node instead of different slave nodes, when the slave node computes the data piece which is mapped to the slave node through the iterative algorithm, the slave node does not need to be in communication with other slave nodes, the traffic is reduced, the requirement for bandwidth resources in a cluster is reduced, and computational efficiency of a cluster system and computational efficiency of nodes of the cluster system are improved.

Description

technical field [0001] The invention relates to the field of distributed computing, in particular to a large-scale graph structure data processing method, device and system. Background technique [0002] Graph is a data structure formed by a collection of vertices and the relationship between vertices, that is, a collection of edges. A graph can also be called a network. Many structures in practical applications can be expressed in the form of graphs. For example, in a matrix, each row and column corresponds to a vertex, and the value at the intersection of a row and a column is not zero. The edge exists, and the weight of the edge is the size of the value; another example is a city map, each city (or resident address) is a vertex, and the routes connecting different cities (or resident addresses) are called edges. A graph structure is an abstract data structure that is a model of objects (vertices, nodes) and their relationships (edges). In the real world, the distributio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30G06F12/02
CPCG06F16/51
Inventor 刘明君赵中英冯铮何一峰冯圣中
Owner SHENZHEN INST OF ADVANCED TECH CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products