A rdf distributed storage method based on multi-layer partition framework

A distributed storage and framework technology, applied in text database clustering/classification, unstructured text data retrieval, semantic tool creation, etc., can solve the problems of low balance and high communication cost, achieve low communication cost, reduce Scale, the effect of improving storage query efficiency

Active Publication Date: 2022-02-22
XI AN JIAOTONG UNIV
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The purpose of the present invention is to provide a RDF distributed storage method based on a multi-layer partition framework to overcome the defects of low balance and high communication cost between physical nodes in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A rdf distributed storage method based on multi-layer partition framework
  • A rdf distributed storage method based on multi-layer partition framework
  • A rdf distributed storage method based on multi-layer partition framework

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0060] like figure 1 As shown, the RDF distributed storage method based on the multi-layer partition framework provided by the present invention realizes the roughening of the RDF graph through the MMA algorithm and the MSLM algorithm, and realizes the k-way segmentation of the RDF graph through the B_AP algorithm. The specific steps as follows:

[0061] Step1 performs the following initialization operations:

[0062] 101) Initialization of RDF graph: let the subject set of RDF triples be T s , the set of predicates is T p , the set of objects is T o , then the RDF graph is: G=(V, E), where V={v|v∈T s ∪T o}, And suppose that n=|V| represents the number of vertices in the RDF graph, and m=|E| represents the number of RDF graph edges.

[0063] 102) Data preprocessing: All N-Triple data sets of RDF data are processed into graph format to...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a RDF distributed storage method based on a multi-layer partition framework. The main steps include: (1) optimizing the movement of vertices in the RDF graph through the MMA algorithm to protect small communities in the RDF graph; (2) using the MSLM algorithm to The RDF graph is roughened, the community structure in the RDF graph is discovered, and the scale of the RDF data is reduced on this basis; (3) The k-way segmentation of the RDF graph is realized through the B_AP algorithm, so that each physical storage node The amount of data between nodes is relatively balanced and the communication cost between nodes is reduced. The invention proposes a complete RDF distributed storage method, which lays a foundation for improving the query efficiency of RDF.

Description

technical field [0001] The invention belongs to the field of distributed storage, and in particular relates to an RDF distributed storage method based on a multi-layer partition framework. Background technique [0002] With the rapid development of the Semantic Web, RDF (Resource Description Framework), as the core standard of the Semantic Web, has also shown explosive growth, and the storage and query management of large-scale RDF data has become a current research hotspot. Traditional single-machine-based RDF storage and query are difficult to manage in the face of ultra-large-scale RDF data due to problems such as poor data scalability, so distributed RDF storage and query have become a research trend, and how to better divide RDF and perform Distributed query has become the focus of RDF research on distributed systems. [0003] Due to the advantages of large storage space and strong scalability, distributed systems and cloud computing platforms have made great progress ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/36G06F16/35
Inventor 刘均王瑞杰晋毓泽张铎魏笔凡王萌姚思雨曾宏伟
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products