Cross-language multi-source vertical domain knowledge graph construction method

A vertical domain, knowledge graph technology, applied in cross-language multi-source vertical domain knowledge graph construction, cross-language knowledge graph construction field, can solve the problem of no knowledge graph construction method, single data source, etc., to improve the extraction efficiency and accuracy performance, enriching graph information, and improving usability

Active Publication Date: 2021-01-08
10TH RES INST OF CETC
View PDF8 Cites 25 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] To sum up, the existing knowledge graphs mainly involve open domains, single data sources, and are mainly single-language graphs; knowledge graphs related to vertical fields are mainly concentrated in the financial field, and most of their data are structured texts; knowledge graphs have great potential in the field of public security. Potential application value, there is currently no relevant cross-language multi-source vertical domain knowledge map construction method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-language multi-source vertical domain knowledge graph construction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] refer to figure 1 . According to the present invention, the following steps are included: vertical domain translation, domain knowledge pre-labeling training, domain knowledge extraction, domain knowledge fusion and disambiguation, wherein the vertical domain translation is based on the input cross-language text, domain dictionary, domain term base, domain Materials and data, complete the construction of parallel corpora through content and link analysis, use active learning technology to realize cross-language automatic corpus annotation, and then complete the construction and training of translation models based on statistical models and neural network models based on parallel corpora. , on the basis of preprocessing, based on the trained translation model, the automatic translation of foreign texts is realized; the domain knowledge pre-labeling training is based on the text data that needs to be labeled, and the active learning labeling based on text segmentation and...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cross-language multi-source vertical domain knowledge graph construction method, and relates to the technical field of knowledge engineering. According to the technical scheme, the method comprises the steps that vertical domain translation completes parallel corpus construction through content and link analysis according to input cross-language texts, domain dictionaries, domain term libraries, domain materials and data, and automatic translation of foreign language texts is achieved based on a trained translation model on the basis of preprocessing; domain knowledgepre-annotation training realizes active learning annotation based on text word segmentation and text clustering, to-be-annotated corpus screening based on an analysis topic is completed, and a confirmed service annotation data set is generated; an optimal algorithm is selected, and semantic feature extraction and entity relationship extraction are completed based on deep learning in combination with vertical domain translation data and an actual scene; and domain knowledge fusion and disambiguation are performed on knowledge from different sources through network equivalent entity combination, so that the cross-language multi-source vertical domain knowledge graph can be obtained.

Description

technical field [0001] The invention relates to the construction of cross-language knowledge graphs in the field of knowledge engineering technology, in particular to a method for constructing cross-language multi-source vertical domain knowledge graphs. Background technique [0002] Knowledge Graph originates from related academic research fields such as semantic web and graph database. Different fields have different emphasis on knowledge graph research, such as natural language processing, knowledge engineering, machine learning, database and data management. Different research and application. The construction of knowledge graph first needs to determine the available data sources, such as structured data, machine-readable open ontology or dictionary, open link data and open knowledge base, industry knowledge base and industry vertical website, online encyclopedia (wiki, interactive, Baidu) and data such as text. Then, effectively collect data, such as open link data co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/36G06F40/49G06F40/58G06F40/295G06F16/35
CPCG06F16/367G06F40/49G06F40/58G06F40/295G06F16/353
Inventor 崔莹代翔杨露李春豹刘鑫黄刘潘磊
Owner 10TH RES INST OF CETC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products