Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Distributed graph data partitioning and copying method based on load driving

A replication method and graph data technology, applied in the computer field, can solve the problems of distributed graph data disaster recovery, expensive network and storage overhead, and achieve the effects of reducing the number of edge cuts, high throughput, and improving query efficiency

Active Publication Date: 2021-05-07
WUHAN UNIV
View PDF6 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Another problem is the disaster recovery of distributed graph data. In order to prevent data loss due to system operation errors or system failures, we need to perform data backup, that is, data replication
The traditional solution is to perform a full backup of all data according to a fixed replication factor, resulting in expensive network and storage overhead

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed graph data partitioning and copying method based on load driving
  • Distributed graph data partitioning and copying method based on load driving
  • Distributed graph data partitioning and copying method based on load driving

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0100] The present invention is mainly based on computer graphics topology, considering dynamic graph characteristics and user workload characteristics, and proposes an experimental method and system for adaptive distributed graph data segmentation and replication. This method fully considers the situation of different users with different workloads. By adaptively adjusting the storage location of graph vertices, the user can visit as few server nodes as possible in a single query, improve query efficiency, and ensure query efficiency. Low latency and high throughput. The results obtained by the invention are more scientific and more accurate.

[0101] The method provided by the invention can use computer software technology to realize the process. see figure 1 , the embodiment of the cluster distributed graph data segmentation and replication as an example to carry out a specific elaboration of the process of the present invention, as follows:

[0102] A workload-driven di...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a distributed graph data partitioning and copying method based on workload driving. The method comprises the following steps of: respectively storing data in a graph data set into a server cluster through a hash table method; constructing an active vertex set in a vertex data set by combining the workload information of a user and the load judgment of a source server, then scoring each active vertex by combining a server cluster according to a scoring function, constructing a target server scoring list, and determining a target migration server through the load judgment of the target server; and combining the active vertex set with the corresponding target migration server, judging whether the active vertexes are high-reading vertexes or not through a threshold value, and if the active vertexes are not the high-reading vertexes, further dynamically adjusting copy data of the active vertexes through the maximum replication factor. According to the method, the position of the graph vertex data is dynamically adjusted by utilizing the characteristic of dynamic change of the workload, so that the query efficiency is improved, and low delay and high throughput of query are ensured.

Description

technical field [0001] The invention belongs to the field of computers, and in particular relates to a load-driven distributed graph data division and replication method. Background technique [0002] In recent years, with the continuous expansion of social network and Wanwei graph data, and the continuous expansion of multi-user access requirements, it is difficult for a common single database server to meet multi-user high-quality access services under existing resources. The traditional solution is to use vertical expansion and complete replication of data, and the resulting high cost is unsatisfactory, so a distributed data storage method emerges as the times require. Distributed data storage is based on cheap server clusters for horizontal partition expansion and partial replication backup, and provides concurrent graph data processing. In the face of distributed parallel data storage, how to reasonably store graph data in different nodes, so that users can improve the...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22G06F16/2453G06F16/2455G06F16/27
CPCG06F16/2255G06F16/2453G06F16/2455G06F16/27
Inventor 涂宏伟刘梦赤
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products