Method for automatically acquiring multi-source heterogeneous data knowledge

A multi-source heterogeneous data and automatic acquisition technology, applied in the field of knowledge acquisition, can solve the problems of insufficient comprehensiveness, convenience, intelligence, different methods, and different personnel, and achieve the effect of improving display quality and recognition rate

Active Publication Date: 2019-11-22
10TH RES INST OF CETC
View PDF8 Cites 40 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Because no matter whether it is manual construction or automatic construction of knowledge graphs, there will be such a problem: either the data sources are different, or the construction personnel are different and the methods are different, which will inevitably lead to some conflicts. These conflicts themselves are difficult to intuitive go
The disadvantage is that it does not provide special support for concept, entity and event extraction, and requires a large amount of labeling corpus support, and manually sets the labeling rules
[0006] At present, there is no research on the unified integration and knowledge acquisition of multi-source heterogeneous data in the existing literature.
At the same time, the research on knowledge acquisition methods of multi-source heterogeneous data sources has not formed a perfect system, and many times still rely on the "patchwork" of independent algorithms
The general knowledge acquisition method is often a simple accumulation of data, and its comprehensiveness, convenience, and intelligence are far from enough

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for automatically acquiring multi-source heterogeneous data knowledge
  • Method for automatically acquiring multi-source heterogeneous data knowledge
  • Method for automatically acquiring multi-source heterogeneous data knowledge

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] refer to figure 1 . According to the present invention, first determine heterogeneous data sources, and convert different data sources into heterogeneous knowledge sources through OCR recognition software, crawlers, direct acquisition and other methods; then, for structured information, semi-structured information in heterogeneous knowledge sources Information and unstructured information cross-exist, conduct knowledge modeling analysis, build knowledge models and multi-source heterogeneous data integration and knowledge extraction platforms. The collected multi-source heterogeneous data sources and the multi-source heterogeneous data integration and extraction platform are used as the data source and platform support of the framework, and the multi-source heterogeneous data knowledge is acquired in three steps. The first is to convert the multi-source heterogeneous data sources The second is to generate structured knowledge based on heterogeneous knowledge sources, an...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for automatically acquiring multi-source heterogeneous data knowledge, and aims to provide a method which has better integrity, universality and convenience and is beneficial to knowledge transmission. The method of the invention is realized through the following technical scheme: the method comprises the following steps: 1, processing; a concept-entity-attribute-relation-label is defined from top to bottom or from bottom to top, a knowledge model of an entity object is obtained, then data is obtained through direct data storage and crawler software, OCR and other recognition software, knowledge data is obtained, and conversion from a heterogeneous data source to a heterogeneous knowledge source is completed; obtaining entity-attribute-relationship triad instantiation under a known knowledge mode through a structured knowledge generation method; and updating knowledge and knowledge models by using a long-short-term memory network model (LSTM model) anda publisher-accomplisher cooperation mode to obtain a workflow for expanding and supplementing new knowledge, and obtaining a data flow accommodating concept, entity, relationship and attribute valueinstantiation triples by using the knowledge model formed by knowledge modeling.

Description

technical field [0001] The invention relates to knowledge acquisition technology in many information processing fields such as knowledge engineering, knowledge expression, natural language understanding, information retrieval, information integration and knowledge management, and in particular relates to multi-source heterogeneous data acquisition technology. Background technique [0002] In recent years, with the rapid development of computer and network technology, information has exploded. In the face of massive amounts of information, analysts are often faced with the dilemma of "the hungry man eats the buffet and doesn't know how to choose". In the process of enterprise information construction, due to the phased, technical, and other economic and human factors in the construction of various business systems and the implementation of data management systems, the enterprise has accumulated a large amount of business data in different storage methods during the developmen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/21G06F16/25G06F16/28G06F16/90G06N3/04G06N3/08
CPCG06F16/212G06F16/258G06F16/284G06F16/90G06N3/049G06N3/08G06N3/045
Inventor 黄细凤廖泓舟代翔彭易锦杨露
Owner 10TH RES INST OF CETC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products