Dynamic power law diagram storage method based on LSM tree

An LSM tree, a dynamic technology, applied in the computer field to achieve good practical value and prospects, reduce data merging overhead and storage resource waste

Active Publication Date: 2021-05-18
NANJING UNIV OF POSTS & TELECOMM
View PDF4 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Purpose of the invention: In order to solve the problems existing in the existing graph storage technology, the present invention provides a dynamic power-law graph storage method based on the LSM tree, which stores the low-degree vertices and the high-degree vertices separately, and according to the power of the current degree of the graph data Allocate memory with statistical information based on power-law distribution, and configure the corresponding number of concurrent overflow writing threads, which can alleviate the problems of data merging overhead and storage resource waste caused by frequent graph data updates based on power-law distribution, and is suitable for storing data with dynamic graph structures

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Dynamic power law diagram storage method based on LSM tree
  • Dynamic power law diagram storage method based on LSM tree
  • Dynamic power law diagram storage method based on LSM tree

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] Below in conjunction with accompanying drawing and specific embodiment, further illustrate the present invention, should be understood that these examples are only for illustrating the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various aspects of the present invention All modifications of the valence form fall within the scope defined by the appended claims of the present application.

[0023] A dynamic power-law graph data storage method based on LSM tree, such as figure 1 As shown in , it mainly includes two memory vertex tables, vertex out-degree statistics table, corresponding disk partition files, and multiple concurrent overflow writing threads. The operating environment can be selected either in a single-node scenario or in a distributed scenario.

[0024] The specific process is as figure 2 As shown, it mainly includes the following steps...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a dynamic power law diagram storage method based on an LSM tree, and the method comprises the steps: carrying out the statistics of the out-degree information of the vertexes of a diagram during the dynamic updating of diagram data, and carrying out the separated storage of the large-degree vertexes and common vertexes in the form of a memory vertex table; allocating memories for the large vertex table and the common vertex table according to a power law distribution proportion; and when the data volume of each vertex table exceeds a threshold value, performing concurrent overwriting on the graph data according to the thread count corresponding to the memory allocation proportion. According to the method, the problems of data merging overhead, storage resource waste and the like caused by frequent updating of graph data based on power law distribution can be relieved, and the method has good practical value and prospect in the field of knowledge graphs and graph calculation.

Description

technical field [0001] The invention belongs to the technical field of computers, and in particular relates to a data storage method of a dynamic power-law graph based on an LSM tree. Background technique [0002] In recent years, graph computing has been increasingly used in fields such as complex networks, deep learning, brain science, and community discovery. With the increasing scale and complexity of graph datasets, how to design an efficient graph storage mechanism has attracted more and more attention. In response to this problem, there have been PB-level distributed file systems based on key-value pairs such as GFS (Google File System) and HDFS (Hadoop File System). The sequential reading of the hard disk is more efficient than random reading, so as to improve the throughput and have good scalability. [0003] However, in a real network, graph data is often updated in real time, and there is a scale-free phenomenon in the degree distribution, that is, the node degr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/22
CPCG06F16/2246G06F16/2282Y02D10/00
Inventor 刘强季一木刘尚东吴飞胡林汤淑宁刘凯航
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products