Distributed storage and indexing method based on knowledge graph RDF data characteristics

A distributed storage and data feature technology, applied in text database indexing, unstructured text data retrieval, text database query, etc., can solve the problems of low execution efficiency, small storage capacity of a single machine, and high maintenance cost

Pending Publication Date: 2020-03-24
TIANJIN UNIV
View PDF7 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0012] In order to overcome the deficiencies of the existing technology, the present invention aims to solve the problems of small storage capacity, high maintenance cost, and low execution efficiency of a single machine, and give full play to the advantages of large distributed stora

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Distributed storage and indexing method based on knowledge graph RDF data characteristics
  • Distributed storage and indexing method based on knowledge graph RDF data characteristics
  • Distributed storage and indexing method based on knowledge graph RDF data characteristics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The basic technical scheme that the present invention adopts is:

[0044] 7) Process the stored data, count the data information and mine the associated information in the data set;

[0045] 8) By using the data information in step 1), an effective entity aggregation index is constructed between triplet classes;

[0046] 9) Carry out ontology division processing for each entity class based on statistical information, increase the aggregation degree of predicates contained in the entity class, and establish a predicate pointing index;

[0047] 10) Carry out S-S connection operation on the data set, save the connection specific category, so as to improve the efficiency of star query with extremely high frequency;

[0048] 11) Divide the data set into levels based on the aforementioned steps to ensure that the data is stored in descending order according to the defined levels;

[0049] 12) For the input query, introduce query optimization to correspond to the above steps...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of distributed graph storage, and aims to solve the problems of small storage capacity, high maintenance cost, low execution efficiency and the like of a single machine and improve the execution efficiency of multiple types of queries. The distributed storage and indexing method based on the knowledge graph RDF data characteristics comprises the following steps of processing stored data, counting data information, and mining associated information in a data set; through the data information in the step 1), constructing an effective entity aggregation index between the triple classes; performing ontology division processing on each entity class based on the statistical information, increasing a predicate aggregation degree contained in the entity class, and establishing a predicate pointing index; carrying out connection operation on the data set, and storing a connection special class so as to improve the star query efficiency with extremely high occurrence frequency; grading the data set based on the above steps, and ensuring that the data is stored in a descending order according to a defined grade; for an input query, introducing query optimization. The method is mainly applied to distributed graph storage occasions.

Description

technical field [0001] The invention relates to the field of distributed graph storage, in particular to the field of storage for large-scale RDF knowledge graphs. Background technique [0002] RDF (Resource Description Framework), a resource description framework, is a markup language used to describe Web resources. It can also be said to be a standard data model for representing and exchanging machine-understandable information in the Semantic Web. RDF uses a triplet of subject, predicate, and object to describe the metadata of a data, which is (s, p, o), where s is the subject, p is the predicate, and o is the object object. RDF data is used in many fields because of its simplicity, openness, and scalability. With the popularity of the Internet, the scale of RDF data continues to increase. The efficient storage and query of RDF data has become a research hotspot. RDF graph is the most intuitive representation of RDF data. [0003] Knowledge graph is an important part of...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F16/31G06F16/33G06F16/36
CPCG06F16/316G06F16/367G06F16/33Y02D10/00
Inventor 王鑫徐炜淇
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products