A sparql query optimization method and system based on predicate association

A query optimization and predicate technology, applied in instrumentation, computing, electrical digital data processing, etc., to achieve the effect of distributed SPARQL query
CN110032676BActive Publication Date: 2022-08-05CENT SOUTH UNIV

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Patents(China)
Current Assignee / Owner
CENT SOUTH UNIV
Publication Date
2022-08-05

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention relates to the technical field of storage and query oriented to big data association, and discloses a SPARQL query optimization method and system based on predicate association, which can realize distributed SPARQL query more quickly and effectively. In the RDF triplet, use the predicate to name the RDF triplet to get the original RDF data set; divide the RDF data set to get the VP table, count the number of subjects and predicates connected by the predicate in the RDF data according to the VP table, and define the four elements of the predicate. Connectivity characteristics, and prioritize the predicates according to the strength of the connectivity characteristics; build the correlation between the predicates, and convert the historical SPARQL query graph into a tree-like predicate graph according to the correlation, optimize the tree-like predicate graph, according to the optimization The resulting tree-like predicate graph generates related tables and converts SPARQL into query commands; query commands are used to query the table to be queried.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the technical field of storage and query oriented to big data association, in particular to a SPARQL query optimization method and system based on predicate association. Background technique

[0002] Resource Description Framework (RDF) is a W3C standard for describing network resources. It uses Internationalized Resource Identifier (IRI) to identify resources, and uses triples consisting of subject s, predicate p and predicate o to describe a metadata for the data. More and more fields describe data in the form of RDF datasets, such as biological sciences, social networks, and search engines, whose datasets contain billions of triples. The huge and ever-increasing RDF data set puts forward higher requirements for data query and information retrieval. In this environment, the SPARQL query language based on Basic Graph Pattern (BGP) is proposed by W3C to facilitate query and Retrieve RDF data.

[0003] At present, the existin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More