A RDF distributed storage method based on multi-layer partitioning framework

A distributed storage and framework technology, applied in text database clustering/classification, unstructured text data retrieval, semantic tool creation, etc., can solve the problems of low balance and high communication cost, achieve low communication cost, improve Storage query efficiency, scale reduction effect

Active Publication Date: 2019-02-15
XI AN JIAOTONG UNIV
View PDF7 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The purpose of the present invention is to provide a RDF distributed storage method based on a multi-layer partition framework to overcome the defects of low balance and high communication cost between physical nodes in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A RDF distributed storage method based on multi-layer partitioning framework
  • A RDF distributed storage method based on multi-layer partitioning framework
  • A RDF distributed storage method based on multi-layer partitioning framework

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] The present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.

[0060] Such as figure 1 As shown, the RDF distributed storage method based on the multi-layer partition framework provided by the present invention, the method realizes the roughening of the RDF graph through the MMA algorithm and the MSLM algorithm, and realizes the k-way segmentation of the RDF graph through the B_AP algorithm, the specific steps as follows:

[0061] Step1 performs the following initialization operations:

[0062] 101) Initialization of RDF graph: Let the subject set of RDF triples be T s , the set of predicates is T p , the object set is T o , then the RDF graph is: G=(V,E), where V={v|v∈T s ∪T o}, And it is assumed that n=|V| represents the number of vertices in the RDF graph, and m=|E| represents the number of edges in the RDF graph.

[0063] 102) Data preprocessing: process all N-Triple data sets of RDF data int...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an RDF distributed storage method based on a multi-layer partition framework. The main steps comprise (1) optimizing the vertex movement in an RDF graph through an MMA algorithm to protect small communities in the RDF graph; (2) roughening the RDF graph by an MSLM algorithm, finding the community structure of RDF graph, and reducing the scale of RDF data; (3) by means of the B_AP algorithm, realizing the k-Way segmentation, so that the amount of data between physical storage nodes is relatively balanced, and the communication cost between nodes is reduced. The inventionprovides the complete RDF distributed storage method, which lays a foundation for improving the query efficiency of the RDF.

Description

technical field [0001] The invention belongs to the field of distributed storage, and in particular relates to an RDF distributed storage method based on a multi-layer division framework. Background technique [0002] With the rapid development of the Semantic Web, RDF (Resource Description Framework), as the core standard of the Semantic Web, has also shown explosive growth. The storage and query management of large-scale RDF data has become a current research hotspot. Traditional stand-alone-based RDF storage and query are difficult to manage in the face of ultra-large-scale RDF data due to poor data scalability and other issues, so distributed-based RDF storage and query has become a research trend, and how to better segment RDF and perform Distributed query has become the focus of RDF research on distributed systems. [0003] Distributed systems and cloud computing platforms have made great progress in various fields because of their advantages such as large storage spa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/36G06F16/35
Inventor 刘均王瑞杰晋毓泽张铎魏笔凡王萌姚思雨曾宏伟
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products