Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Multi-source data and time sequence processing method and device for construction of industry knowledge graph

A knowledge graph and time series technology, applied in the field of multi-source data and time series processing for building industry knowledge graphs, can solve the problems of manual participation, low degree of automation, difficult time series data processing, etc. Guaranteed effect of accuracy

Pending Publication Date: 2020-04-10
SHANGHAI GEOTECHN INVESTIGATIONS & DESIGN INST
View PDF0 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Theoretically, knowledge graphs can solve the problems of multi-source heterogeneous data extraction, fusion and storage, but there is currently no effective multi-source data processing method for knowledge graphs in vertical fields, and some well-known domain knowledge bases are mostly manually constructed by domain experts , although the quality is high, there are problems such as low degree of automation, low knowledge coverage, and slow update. If the industry knowledge graph is constructed in a semi-automatic way, and the knowledge extraction and fusion methods of the general knowledge graph are used, the quality of the knowledge base is difficult to guarantee, and which Issues such as the need for manual participation in the link and specific processing methods have always been key issues in the implementation of domain knowledge graphs; in addition, the skyrocketing and updating of data shows that knowledge graphs are dynamic, and many knowledge in vertical fields involve time series, such as daily observations, Daily sales volume, daily stock closing price, monthly average temperature, etc. These types of data are not suitable for direct storage in graph databases, but they are necessary for analysis and research. How should knowledge graphs handle this part of time series data?
[0004] To sum up, the construction of industry knowledge graphs has disadvantages such as low efficiency of multi-source massive data processing and difficulty in time series data processing. Those skilled in the art should seek multi-source data and time series processing methods for building industry knowledge graphs

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-source data and time sequence processing method and device for construction of industry knowledge graph
  • Multi-source data and time sequence processing method and device for construction of industry knowledge graph
  • Multi-source data and time sequence processing method and device for construction of industry knowledge graph

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0031] Example: such as Figure 1-4 As shown, this embodiment specifically relates to a multi-source data and time series processing method for constructing an industry knowledge graph, and the processing method mainly includes the following steps:

[0032] S1: Combining domain knowledge and expert experience to construct the ontology layer of the industry knowledge graph, the ontology layer includes ontology, ontology attributes, and ontology relationships, such as figure 2 As shown, the box represents the ontology, the ellipse represents the ontology attribute, and the rhombus represents the ontology relationship between ontologies; among them, the ontology, ontology attributes and ontology relationships included in the ontology layer have been manually reviewed and verified by experts to ensure the rationality and completeness. cover.

[0033] S2: if image 3 As shown, entities and entity attributes are extracted from multiple data sources, and entities are checked for i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-source data and time sequence processing method and device for construction of an industry knowledge graph. The method comprises the following steps: constructing an ontology layer of the knowledge graph, wherein the ontology layer comprises ontologies, ontology attributes and an ontology relation; extracting entities and entity attributes from multiple data sources, and conducting inconsistency check on the entities; conducting inconsistency check on the entity attributes of all the entities, wherein an entity relationship between the entities inherits the ontology relationship between the ontologies corresponding to the entities; and establishing indexes of a knowledge graph database and a time sequence database. The method has the advantages that throughthe uniqueness of a standard naming table and the relational database, entity conflict resolution accuracy is ensured to the greatest extent; by utilizing the advantage that the knowledge graph has the ontology layer, accurate classification of the entity attributes is realized by establishing similar labels, and fusion efficiency is effectively improved; and the problem that the knowledge graph is difficult to process time sequence data is solved by establishing indexes between knowledge graph database ontologies and a time sequence database form.

Description

technical field [0001] The invention belongs to the technical field of knowledge graphs, and in particular relates to a multi-source data and time series processing method and device for constructing industry knowledge graphs. Background technique [0002] In the context of the era of big data, with the emergence of massive data and the cross-application of multi-data source fusion, the problem of multi-source heterogeneous data fusion has become increasingly prominent, and data association is particularly important. When big data is processed and analyzed, it is limited to a small area and cannot dig out more valuable information. In recent years, knowledge graph, as a new knowledge representation method and data management model, establishes various entities or concepts and their relationships in the real world, and realizes data integration from different data sources. For knowledge graphs in vertical fields, data sources mainly include three types: one is the data of th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F16/36G06F16/31G06F16/28G06F16/25G06F16/2458
CPCG06F16/367G06F16/316G06F16/313G06F16/284G06F16/254G06F16/2474
Inventor 杜续苏辉张静许丽萍杨石飞雷丹
Owner SHANGHAI GEOTECHN INVESTIGATIONS & DESIGN INST
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products