Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method for automatically acquiring knowledge from multi-source heterogeneous data

A multi-source heterogeneous data, automatic acquisition technology, applied in the field of knowledge acquisition, can solve the problems of multi-source heterogeneous data source knowledge acquisition methods that do not form a complete system, the degree of comprehensiveness, convenience and intelligence are not enough, and the data sources are different.

Active Publication Date: 2022-07-29
10TH RES INST OF CETC
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Because no matter whether it is manual construction or automatic construction of knowledge graphs, there will be such a problem: either the data sources are different, or the construction personnel are different and the methods are different, which will inevitably lead to some conflicts. These conflicts themselves are difficult to intuitive go
The disadvantage is that it does not provide special support for concept, entity and event extraction, and requires a large amount of labeling corpus support, and manually sets the labeling rules
[0006] At present, there is no research on the unified integration and knowledge acquisition of multi-source heterogeneous data in the existing literature.
At the same time, the research on knowledge acquisition methods of multi-source heterogeneous data sources has not formed a perfect system, and many times still rely on the "patchwork" of independent algorithms
The general knowledge acquisition method is often a simple accumulation of data, and its comprehensiveness, convenience, and intelligence are far from enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method for automatically acquiring knowledge from multi-source heterogeneous data
  • A method for automatically acquiring knowledge from multi-source heterogeneous data
  • A method for automatically acquiring knowledge from multi-source heterogeneous data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] see figure 1 . According to the present invention, the heterogeneous data sources are first determined, and the different data sources are converted into heterogeneous knowledge sources through methods such as OCR identification software, crawler, direct acquisition, etc.; To solve the problem of cross-existence of information and unstructured information, conduct knowledge modeling and analysis, and build a knowledge model and a multi-source heterogeneous data integration and knowledge extraction platform. The collected multi-source heterogeneous data sources and multi-source heterogeneous data integration and extraction platform are used as the data source and platform support of the framework, and the multi-source heterogeneous data knowledge is acquired in three steps. One is to convert the multi-source heterogeneous data sources. The second is to generate structured knowledge based on heterogeneous knowledge sources, and the third is to update knowledge and knowle...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A method for automatically acquiring multi-source heterogeneous data knowledge disclosed in the present invention aims to provide a method that is more complete, versatile and convenient, and is beneficial to the acquisition of knowledge transfer. The present invention is realized by the following technical solutions: define concept-entity-attribute-relation-label in a top-down or bottom-up manner, obtain the knowledge model of the entity object, and then directly save the data with crawler software, OCR, etc. Recognition software obtains data, obtains knowledge data, and completes the conversion of heterogeneous data sources to heterogeneous knowledge sources; obtains triple instantiation of entity-attribute-relationship in known knowledge mode through structured knowledge generation method; The short-term memory network model (LSTM model) and the publisher-completer collaboration model update knowledge and knowledge models, and obtain the workflow of expanding and supplementing new knowledge. Using the knowledge model formed by knowledge modeling, obtain concepts, entities, relationships, The attribute value instantiates a stream of triples.

Description

technical field [0001] The invention relates to knowledge acquisition technology in many information processing fields such as knowledge engineering, knowledge expression, natural language understanding, information retrieval, information integration and knowledge management, in particular to multi-source heterogeneous data acquisition technology. Background technique [0002] In recent years, with the rapid development of computer and network technology, information has exploded. In the face of massive amounts of information, analysts are often faced with the dilemma of "hungry men don't know how to choose from a buffet". In the process of enterprise informatization construction, due to the phased, technical, and other economic and human factors of the construction of various business systems and the implementation of data management systems, enterprises have accumulated a large number of business data in different storage methods during the development process. Including ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/21G06F16/25G06F16/28G06F16/90G06N3/04G06N3/08
CPCG06F16/212G06F16/258G06F16/284G06F16/90G06N3/049G06N3/08G06N3/045
Inventor 黄细凤廖泓舟代翔彭易锦杨露
Owner 10TH RES INST OF CETC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products