Graph data storing method and subgraph enquiring method based on external memory

A graph data and data technology, which is applied in the fields of storing large-scale graph data, building indexes and executing subgraph queries based on external memory. Storage and query requirements and other issues, to achieve the effect of high efficiency and high scalability

Active Publication Date: 2011-11-23
PEKING UNIV
View PDF2 Cites 21 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] However, for subgraph queries, most of the existing methods and systems are based on memory, so they only support operations on small-scale graphs, which obviously cannot adapt to the growing data scale at this stage; On the other hand...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Graph data storing method and subgraph enquiring method based on external memory
  • Graph data storing method and subgraph enquiring method based on external memory
  • Graph data storing method and subgraph enquiring method based on external memory

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The present invention is a method for storing and querying according to the characteristics of graph data. For the whole method framework, see figure 1 . The method is divided into two parts, the data preprocessing part and the query execution part. Data preprocessing is mainly a method of storing and constructing indexes designed for large-scale graph data. The constructed index includes a cost estimation module based on the data histogram, a bitmap index for graph data, and B+- Tree storage; the query execution part is the implementation method of subgraph query for the index and storage we designed. The query execution part is to use the cost estimation module to decompose the query graph to obtain several sub-patterns, and then use the bitmap index to divide the subgraph After connecting between the patterns to obtain intermediate results, we then use the graph data stored in the B+-Tree generated by the data preprocessing part to find the matching results of the sub...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a graph data storing method and a subgraph enquiring method based on external memories. The invention belongs to the technical field of database. The graph data storing method comprises steps that: (1) graph data formats are unified into a standard graph data format; (2) the edges of the graph data are classified and stored according to the tag information of the starting point and the end point of each edge of the graph data, and a B+-Tree index is established for each class of edges; (3) the points of the graphs are divided into a plurality of domains according to the tag information of each point of the graph data, wherein the points in a same domain orderly corresponds to a bit according to the sequence of identifiers; then, according to the tag information of the starting points and the end points of the edges, a bitmap index is established for each class of edges in the step (2); and (4) a starting point data histogram and an end point data histogram are established for each class of edges. The subgraph enquiring method comprises steps that: first, a subgraph to be enquired is decomposed; then the sub-modules obtained through decomposition are enquired; and the enquiry results are integrated. The methods provided by the invention have characteristics of high enquiring efficiency and good extensibility.

Description

Technical field [0001] The invention belongs to the field of database technology and graph data management, and mainly relates to a method for storing large-scale graph data, constructing an index, and executing subgraph query based on external memory. Background technique [0002] Graph database is a new type of database technology that uses graph structure and attributes to represent and store information. It is a type of NoSQL database. A general graph database should be able to store any form of graphs, including geographic maps, social networks, and so on. [0003] The graph database is based on graph theory, which uses the concepts of points and edges in graph theory. Among them, dots are often used to represent entities in reality, such as people, companies, accounts, and everything else you want to record. Edges are used to connect two points and indicate the relationship between the two points. Generally speaking, other information will be attached to the point or side....

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
Inventor 彭鹏邹磊赵东岩贾爱霞
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products