Unlock instant, AI-driven research and patent intelligence for your innovation.

A Design Method for Parallel Processing Framework Supporting Large-Scale Dynamic Graph Data Query

A parallel processing and data query technology, applied in the design of parallel processing framework and graph data processing framework design, can solve the problems of reduced processing efficiency, high memory performance requirements, weak data computing throughput, etc., to achieve large data scale , the effect of many iterations

Active Publication Date: 2021-08-24
LIAONING UNIVERSITY
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Comparing the two basic frameworks, the MapReduce framework can perform batch processing on large blocks of files, such as figure 1 As shown, the framework has a relatively mature programming interface, is easy to program and implements, and has a high degree of versatility and abstraction. However, for iterative calculations, jobs can only be started multiple times in a chain, resulting in high network transmission overhead and greatly reducing It improves processing efficiency, so it is not suitable for iterative calculations and calculations with high real-time requirements; while the BSP framework introduces the concept of "super step", which is a state framework, such as figure 2 As shown, this framework can better solve the problem of iterative computing, and is suitable for iterative computing and matrix computing. However, because it is a framework based on memory computing, it has high requirements for the memory performance of each node in the cluster, and the existing implementation framework In terms of version stability and various mechanisms, etc., it is not mature enough, and the data calculation throughput is weak

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Design Method for Parallel Processing Framework Supporting Large-Scale Dynamic Graph Data Query
  • A Design Method for Parallel Processing Framework Supporting Large-Scale Dynamic Graph Data Query
  • A Design Method for Parallel Processing Framework Supporting Large-Scale Dynamic Graph Data Query

Examples

Experimental program
Comparison scheme
Effect test

example

[0076] (1) A small cluster is constructed, which consists of three identical PCs, one of which is used as the Master node, and the other two are used as Slave nodes. Use the classic Dijkstra single-source shortest path algorithm to process graph files. In the single-source shortest path algorithm, a given graph G=(V, E, W), where V is the set of vertices, E is the set of directed edges, and W is the set of non-negative weights. Select a vertex v in V as the source, and calculate the shortest path length from v to other vertices, that is, find the minimum value of the sum of the weights of each edge.

[0077] (2) Dijkstra's algorithm generates the shortest path from the source point to each vertex according to the increasing order of the weights between each vertex in the vertex set and the source point. Its algorithm is similar to the breadth-first search traversal algorithm of the graph, that is, to find the shortest path with the smallest weight first, and then refer to it ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a parallel processing framework and design method supporting large-scale dynamic graph data query. The design method is as follows: the parallel processing framework combines the MapReduce framework and the BSP framework, takes the Hadoop framework as the underlying foundation, and adopts the "MapReduce+BSP" mode To meet the needs of large amount of data and multiple iterations of graph calculation, at the same time, the parallel processing framework realizes real-time monitoring function to check the dynamic changes of graph files by building iteration control module, real-time monitoring module, Map stage function design, and Reduce stage function design . The framework designed by the invention combines the advantages of the MapReduce framework and the BSP framework into one, and avoids the disadvantage that the MapReduce framework cannot perform multiple iteration calculations while solving the problem that the BSP framework cannot realize high-throughput calculations.

Description

technical field [0001] The invention belongs to the technical field of large-scale graphs, and in particular relates to a design of a graph data processing framework, in particular to a design of a parallel processing frame supporting large-scale dynamic graph data query. Background technique [0002] With the rapid development of "big data" and "cloud computing" technologies, a new generation of information and communication technologies has emerged. Intelligent transportation, social networking and other fields have been difficult to use traditional Internet technology to meet the needs of the public. How to deal with the complex graph structure behind these fields has become the focus of more and more research institutions. In an environment where the amount of information grows and the relationship between data is complex, the processing technology of large-scale dynamic graphs is changing with each passing day. Research institutions represented by Google, Facebook, Apa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/27G06F16/28
CPCG06F16/27G06F16/284
Inventor 宋宝燕王俊陆单晓欢丁琳琳张毅隆尹东晓
Owner LIAONING UNIVERSITY