SPARQL semantic data query optimization method based on connection cost

A data query and connection cost technology, which is applied in the direction of electrical digital data processing, special data processing applications, instruments, etc., can solve the problem of low efficiency of massive RDF semantic data query, and achieve the goal of improving user experience, improving efficiency, and rapid result feedback Effect

Inactive Publication Date: 2015-08-12
WUHAN UNIV
View PDF4 Cites 16 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The invention aims to solve the problem of low efficiency of massive RDF semantic data query, and designs a SPARQL semantic query optimization method based on connection cost

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • SPARQL semantic data query optimization method based on connection cost
  • SPARQL semantic data query optimization method based on connection cost
  • SPARQL semantic data query optimization method based on connection cost

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The method is described in detail in conjunction with accompanying drawings and implementation examples, figure 1 It is a flow chart of the method, and the specific steps are as follows:

[0038] Step 1, build RDF semantic data index, use B-tree structure to index and store RDF semantic data, choose spo, pos, osp three indexing methods; among them, s is the subject, p is the predicate, and o is the object; the generated in this step The object whose index data will be used for subsequent SPARQL semantic data queries;

[0039] In this embodiment, RDF can be represented by a triplet pattern, that is, in the form of subject-predicate-object (spo). Since the variable positions of the triplet pattern are only subject, predicate, and object, there are 8 triplet patterns situation. Remove the most specialized pattern (s p o) and the most generalized pattern (?s?p?o), where, ? s,? p and ? o represents the variable to be queried, and according to the subject, predicate and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a SPARQL semantic data query optimization method based on connection cost. According to the method, the RDF mode information to simplify SPARQL basic graph pattern is used, then a B tree structure is used for rapidly estimating SPARQL connection diagram nose sizes and edge weight values, the connection cost is used for estimating and finding the optimal logical query plan by combining a dynamic planning method, and therefore the query efficiency for improving RDF semantic data is improved.

Description

technical field [0001] The invention belongs to the technical field of computer query optimization, and in particular relates to a connection cost-based SPARQL semantic query optimization method. Background technique [0002] At present, the scale of Linked Data is increasing year by year, and the efficiency of semantic query based on Linked Data still needs to be improved. Linked data is generally expressed by RDF (Resource Description Framework). At present, the research on RDF document query optimization is mainly divided into two aspects: one is to establish an effective index mechanism for RDF documents, and Oracle, Mysql and other relational databases for RDF documents The serialization index mechanism; the other is the optimization of the RDF standard query language SPARQL. The former mainly relies on the RDF index structure, disk index storage method or database characteristics to achieve high I / O throughput performance; the latter studies its query mechanism from t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
CPCG06F16/24544G06F16/2246
Inventor 徐雷方卿袁小群
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products