RDF data storage and query method combined with star figure coding

A technology of data storage and query methods, which is applied in the field of RDF data storage and query combined with star graph coding, and can solve problems such as storage space becoming larger.

Inactive Publication Date: 2015-03-25
FUZHOU UNIV
View PDF3 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Multiple triples as a query subtask require redundant

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • RDF data storage and query method combined with star figure coding
  • RDF data storage and query method combined with star figure coding
  • RDF data storage and query method combined with star figure coding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0060] Below in conjunction with accompanying drawing and embodiment the present invention will be further described

[0061] This embodiment provides a method for storing and querying RDF data combined with star diagram encoding, such as figure 1 shown, including the following steps:

[0062] Step S1: Preprocessing the RDF data, presenting the RDF data as an RDF data graph; the preprocessing includes a star data segmentation stage and a star graph encoding stage;

[0063] Step S2: Present the input SPARQL query statement in the form of a SPARQL query graph, decompose the query, parse it into star nodes, and form a query subgraph G;

[0064] Step S3: Preprocessing the SPARQL query statement to obtain the number of tasks in the entire query, the connection sequence of the query star child nodes, and the relevant information of the query star child nodes; the relevant information of the query star child nodes includes Subject type, query variable, connection variable, inde...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to an RDF data storage and query method combined with star figure coding. The RDF data storage and query method comprises the steps that S1, RDF data are preprocessed, and the RDF data are presented in an RDF data map mode; S2, an input SPARQL query statement is presented in an SPARQL query graph mode, and query decomposition is carried out; S3, the SPARQL query statement is preprocessed, and the task number of whole query, the connecting sequence of query star sub-nodes and relevant information of the query star sub-nodes are obtained; S4, the SPARQL query statement is executed, query connection planning is carried out, a Map Reduce parallel computation frame of Hadoop is adopted, and the number of times of starting a query task Job is decided according to the relevance of the SPARQL query statement; S5, subgraph query is carried out, and a Map function is adopted; S6, a result connecting algorithm is carried out, and a Reduce function is adopted. Due to the fact that a Hash coding index query strategy based on star configuration is adopted, stored data redundancy and the number of query tasks are reduced, and query efficiency is improved.

Description

technical field [0001] The invention belongs to the technical field of massive RDF data management, and in particular relates to a method for storing and querying RDF data combined with star diagram coding. Background technique [0002] At present, some researches have proposed RDF data storage and management based on cloud platform. For example: (1) Use SimpleDB (a Key-Value Store provided by AWS) to answer SPARQL queries, and propose a series of index methods to determine which RDF can be quickly determined by using its index structure after a given query Datasets may contain query results. (2) Using Hadoop to store and retrieve RDF datasets, and discuss the query plan generation algorithm under this platform. (3) Divide the data into multiple small files according to the type of attributes and objects and store them in HDFS. In terms of query processing, a greedy MapReduce job generation algorithm is adopted, and multiple jobs iteratively process the connection operatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/30
CPCG06F16/81G06F16/8373
Inventor 汪璟玢卢桂芳
Owner FUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products