Method for processing data of resource description framework

A resource description framework and processing method technology, applied in the field of data storage, can solve problems such as data redundancy, inability to quickly locate RDF triple data, unfavorable rapid decompression, etc., to achieve high compression efficiency, fast decompression speed, and high efficiency The effect of the compression method

Inactive Publication Date: 2012-06-27
HUAZHONG UNIV OF SCI & TECH
View PDF4 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

BitMat uses the D-Gap compression method, although it has a good compression effect, but its compression method is not conducive to fast decompression; RDF-3X uses the block-based Delta compression method, but its compression method cannot quickly locate A specific RDF triple data, and in order to improve the query speed, there is a lot of redundancy in the stored data
All in all, the relationship between data compression efficiency and data decompression is not well balanced in these systems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for processing data of resource description framework
  • Method for processing data of resource description framework
  • Method for processing data of resource description framework

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] The present invention will be described in detail below in conjunction with the accompanying drawings and examples.

[0052] like Figure 1-4 As shown, the method for processing resource description frame data of the present invention includes the following steps:

[0053](1) Process the resource description frame data by using the hash algorithm to generate an N*3 matrix, where N is an integer greater than 1, and the three columns of the matrix represent the subject array, predicate data, and object array respectively;

[0054] (2) Determine the maximum eid in the subject array and object array max , and the maximum pid in the predicate array max ;

[0055] (3) Establish the association matrix M of resource description framework data, wherein, the size of the association matrix is ​​(eid max +1)*N, and initialize all bits of the association matrix M to 0;

[0056] (4) Set the bit values ​​in the association matrix according to the matrix in step (1), and convert t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for processing data of a resource description framework. The method comprises the following steps of: (1) processing the data of the resource description framework by using a Hash algorithm to generate an N*3 matrix, wherein N is an integer of more than 1, and three columns of the matrix respectively represent a subject array, a predicate array and an object array; (2) determining a maximum value eidmax in the subject array and the object array and a maximum value pidmax in the predicate array; (3) establishing an associated matrix M of the data of the resource description framework, wherein the size of the associated matrix is (eidmax+1)*N, and initializing all bits of the associated matrix M to 0; (4) according to the matrix, setting a bit value in the associated matrix, and converting the data of the resource description framework; and (5) compressing the associated matrix M. By adoption of the method, a large number of data of the resource description framework can be efficiently stored.

Description

technical field [0001] The invention relates to the field of data storage, and more specifically, the invention relates to a method for processing resource description framework data. Background technique [0002] Resource Description Framework (RDF) has become one of the standard formats for data exchange. It describes the properties of a resource on the Internet and its relationship to other resources. Formally, RDF can be represented by a triple: subject, predicate, and object. [0003] In RDF data, entities are usually represented by Uniform Resource Identifier (URI) or literal (Literal), and many of them are repeated, so these URIs or literals are usually converted into IDs when storing (integer) for storage. This reduces the storage space and facilitates processing in queries. On this basis, according to the distribution characteristics of IDs, various compression methods are proposed to reduce the space occupied by IDs. In addition, compressing the ID in data que...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30
Inventor 袁平鹏金海赵峰刘谱吴步文
Owner HUAZHONG UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products