Graph-based RDF data management method and apparatus, and storage medium

A data management and physical storage address technology, applied in the field of graph-based RDF data management methods, devices and storage media, can solve the problems of underdeveloped SSD performance and low data management performance, and achieve scalable RDF data management, Improve data management performance, the effect of high parallelism

Inactive Publication Date: 2019-08-09
天津大学深圳研究院
View PDF10 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The main purpose of the embodiments of the present invention is to provide a graph-based RDF data management method, device, and storage medium, which can at least solve the problem of poor SSD performance caused by random storage of RDF data on SSDs in the related art. Full performance, and issues with low SSD-based data management performance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Graph-based RDF data management method and apparatus, and storage medium
  • Graph-based RDF data management method and apparatus, and storage medium
  • Graph-based RDF data management method and apparatus, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

no. 1 example

[0028] In order to solve the technical problems that the RDF data is randomly stored in the idle storage unit on the SSD, the performance of the SSD is not fully utilized, and the performance of the data management based on the SSD is low. This embodiment proposes a graph-based The RDF data management method applied to SSDs with multiple storage units. Such as figure 1 Shown is a schematic flow chart of the graph-based RDF data management method provided in this embodiment. The graph-based RDF data management method proposed in this embodiment includes the following steps:

[0029] Step 101. Create an RDF graph based on the RDF data to be stored; each element of all triples in the RDF data to be stored corresponds to a node on the RDF graph.

[0030] Specifically, RDF data contains multiple resource descriptions, and a resource description is composed of multiple sentences, and a sentence is a triplet composed of resources, attributes, and attribute values. A statement in a ...

no. 2 example

[0050] In order to solve the technical problems in the related art that the RDF data is randomly stored in the idle storage unit on the SSD, the performance of the SSD is not fully utilized, and the performance of the data management based on the SSD is low. This embodiment shows a graph-based The RDF data management device of , applied to SSD with multiple storage units. For details, please refer to Figure 5 , the RDF data management device in this embodiment includes:

[0051] The creating module 501 is used to create an RDF graph based on the RDF data to be stored; each element of all triples in the RDF data to be stored corresponds to a node on the RDF graph;

[0052] The storage module 502 is configured to store each node corresponding to each element in the triplet in storage units with different physical storage addresses on the SSD;

[0053] The saving module 503 is used for storing the storage unit stored in the upper node, saving the physical storage address list ...

no. 3 example

[0063] This embodiment provides an electronic device, see Image 6 As shown, it includes a processor 601, a memory 602, and a communication bus 603, wherein: the communication bus 603 is used to realize connection and communication between the processor 601 and the memory 602; the processor 601 is used to execute one or more programs stored in the memory 602 A computer program to implement at least one step in the graph-based RDF data management method in the first embodiment above.

[0064] The present embodiment also provides a computer-readable storage medium, which includes information implemented in any method or technology for storing information, such as computer-readable instructions, data structures, computer program modules, or other data. volatile or nonvolatile, removable or non-removable media. Computer-readable storage media include but are not limited to RAM (Random Access Memory, random access memory), ROM (Read-Only Memory, read-only memory), EEPROM (Electric...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

Embodiments of the invention disclose a graph-based RDF data management method and apparatus, and a storage medium. The method comprises the steps of creating an RDF graph based on to-be-stored RDF data; respectively storing nodes corresponding to elements in the triple on the RDF graph in different storage units on the SSD; and storing a physical storage address list comprising the physical storage addresses of all the lower-level nodes corresponding to the upper-level node in a storage unit stored in the upper-level node, and storing a corresponding relation between each node and the physical storage addresses of the storage unit stored in each node in a node address index table. Through the implementation of the method and the device, the RDF data is converted into the graph data to bemanaged, the structural characteristics of the RDF data are better reserved, the data can be conveniently explored from any node, the realization of comprehensive and extensible RDF data management isfacilitated, the high parallelism of the SSD is fully utilized, and the data management performance on the SSD is greatly improved.

Description

technical field [0001] The invention relates to the field of data storage, in particular to a graph-based RDF data management method, device and storage medium. Background technique [0002] In the era of big data, information presents highly unstructured and free and rich correlations, and the data sets of many knowledge bases such as Weibo and Facebook are usually stored in the form of Resource Description Framework (RDF, Resource Description Framework). RDF data is actually composed of some columns of triplet data, where each triplet is composed of three elements: resource, attribute and attribute value, also known as subject (Subject), predicate (predicate) and the object (Object). [0003] With the popularity of RDF in recent years, the amount of RDF data has increased greatly, and billions of triples have been generated in many RDF datasets (such as Wikipedia). Therefore, how to effectively manage these huge RDF data has become a huge challenge. At present, RDF data...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/31G06F16/18G06F16/13
CPCG06F16/13G06F16/1847G06F16/31
Inventor 陈仁海燕国骅关启明冯志勇
Owner 天津大学深圳研究院
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products