Unlock instant, AI-driven research and patent intelligence for your innovation.

Detection method for RDF data redundancy semantics

A detection method and data redundancy technology, applied in semantic analysis, electrical digital data processing, natural language data processing, etc., can solve the problems of lack of large-scale data structure design, time performance degradation, etc., to improve speed and good time performance , Solve the effect of high time complexity

Pending Publication Date: 2022-07-01
NANJING UNIV OF AERONAUTICS & ASTRONAUTICS
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, when the current research is faced with massive data, the time performance will be greatly reduced, and there is a lack of structural design for large-scale data.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Detection method for RDF data redundancy semantics
  • Detection method for RDF data redundancy semantics
  • Detection method for RDF data redundancy semantics

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make those skilled in the art better understand the technical problems, technical solutions and technical effects of the present invention, the present invention is further described below with reference to the accompanying drawings.

[0041] The present invention designs a redundant semantic detection method for RDF data. RDF models such asfigure 1 As shown, RDF always comes in the form of a triple (Triple), a triple, also called a statement (Statement). Each triplet contains three components, which are the described subject (Subject), a predicate (Predicate) of the subject, and the corresponding value (Object), also known as the object. A triple can be represented simply by . In the data set composed of RDF, there are three types of data. They are URI (Uniform Resource Identifier, uniform resource identifier), blank node (Blank Node) and literal node (Literal Node). Based on the data characteristics of RDF, the present invention designs a method for mas...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method for detecting redundant semantics of RDF (Resource Description Framework) data. The RDF is a representation model of the mapping knowledge domain, and in order to detect redundant semantics of the mapping knowledge domain represented by the RDF, on the basis of summarizing and analyzing an existing RDF redundant semantic detection method, an RDF similarity algorithm is improved, and weight design is performed on contribution of different semantic information to similarity. And the weight setting is domain independent and automatic. And a pruning technology is also designed in the representation of the semantic information, so that the similarity calculation speed is effectively improved. In addition, on the basis of a similarity algorithm, the invention further provides a selection method for screening candidate objects, and the selection method is used for searching approximate candidate data in a data set. The method is based on a locality sensitive hashing algorithm, and the algorithm can effectively solve the problem of high time complexity caused by large-scale RDF data linear search, and has good time performance.

Description

technical field [0001] The invention discloses a method for detecting redundancy semantics of RDF data, which belongs to the field of RDF semantic redundancy. Background technique [0002] Linked data is widely used to publish, deploy and share Internet resources, and its expression is to construct named entities and express the relationship between entities by using the Resource Description Framework (RDF) data model. The development and standardization of the Semantic Web has resulted in a large amount of data being published on the web as Linked Data, however, it should be noted that due to the diversity of data sources, Linked Data covering different domains is usually generated independently, distributed in many locations, and are heterogeneous. When data from different sources needs to be integrated, it is possible to introduce duplication in Linked Data. That is, in Linked Data, different URIs may point to the same entity in the real world. This phenomenon is also ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/30G06F40/289G06K9/62
CPCG06F40/30G06F40/289G06F18/23G06F18/22
Inventor 陈一鸣严丽
Owner NANJING UNIV OF AERONAUTICS & ASTRONAUTICS