Parallel reasoning algorithm for streaming RDF data

A data and algorithm technology, applied in the field of stream RDF data parallel reasoning algorithm, can solve problems such as difficult to obtain triples and inapplicability, and achieve the effect of reducing storage space and reasoning time, improving efficiency, and improving efficiency

Active Publication Date: 2017-07-25
FUZHOU UNIV
View PDF2 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, since Slider is only designed for RDFS rules, it is not suitable for complex OWL Horst rule reasoning
[0004] Today's challenges in large-scale RDF file reasoning are: distributed data on the network is difficult to obtain appropriate triples; the growing amount of data requires scalable computing power for large data sets; existing reasoning methods are Designed for static ontologies where data is usually changing in the real world

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Parallel reasoning algorithm for streaming RDF data
  • Parallel reasoning algorithm for streaming RDF data
  • Parallel reasoning algorithm for streaming RDF data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The present invention will be further explained below in conjunction with the accompanying drawings and specific embodiments.

[0018] The streaming parallel reasoning proposed by the present invention is mainly divided into three parts: constructing a pseudo-bidirectional network, classifying streaming data and reasoning of OWLHorst rules. According to the characteristics of Spark Streaming and Redis, combined with the HAL algorithm, OWLHorst rules and RDF data ontology, a regular pseudo-bidirectional network is constructed, which contains class nodes and rule nodes corresponding to pattern triples. If there are class connection variables in the rule nodes, then Establish an intermediate node; then, regularly obtain batches of new data in the Streaming data stream and the data generated by the previous reasoning as input data, classify the input data or create a new corresponding node and store it in the corresponding Redis cluster; then, for the input The triplet data...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a parallel reasoning algorithm for streaming RDF data, which comprises the steps of building a pseudo bidirectional network of a rule, and building an intermediate node if a class link variable exists in rule nodes; acquiring a batch of new data in a streaming data flow and data generated by previous reasoning at regular time to act as input data, performing classification on the input data or newly building corresponding nodes, and storing the nodes into corresponding Redis clusters; judging whether antecedents monitored by the corresponding intermediate nodes or rule nodes are all satisfied or not by combining the pseudo bidirectional network for the inputted triple data, thus performing reasoning on the rule, and generating reasoning data; and realizing parallel streaming reasoning for an OWL Horst rule of the RDF data integrally and efficiently through deleting repeated reasoning data in real time and storing all data generated by the current time of reasoning into the Redis clusters to act as input data of the next time of reasoning.

Description

technical field [0001] The invention belongs to the technical field of semantic web, and specifically relates to a parallel reasoning algorithm for streaming RDF data. Background technique [0002] In recent years, researchers have gradually realized the importance of research on parallel reasoning algorithms for real-time streaming data, but there are still few related algorithms proposed in this field, and further research is needed. At the same time, there are quite a lot of researches on reasoning in intelligent technology, such as knowledge discovery and case reasoning. It has become a consensus in academia and industry to solve large-scale RDF streaming data related problems through distributed parallel computing. [0003] Research on RDFS / OWL streaming parallel reasoning is a relatively new field at present. Barbieri D F et al. proposed an incremental reasoning algorithm based on streaming and rich background knowledge. This algorithm adds expiration time informatio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N5/04
Inventor 汪璟玢叶怡新
Owner FUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products