Cross-language multi-source vertical domain knowledge graph construction method

A vertical field and knowledge map technology, applied in cross-language knowledge map construction, cross-language multi-source vertical field knowledge map construction field, can solve problems such as no knowledge map construction method, single data source, etc., to improve extraction efficiency and accuracy Sexuality, enriching map information, and reducing work costs

Active Publication Date: 2022-07-08
10TH RES INST OF CETC
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] To sum up, the existing knowledge graphs mainly involve open domains, single data sources, and are mainly single-language graphs; knowledge graphs related to vertical fields are mainly concentrated in the financial field, and most of their data are structured texts; knowledge graphs have great potential in the field of public security. Potential application value, there is currently no relevant cross-language multi-source vertical domain knowledge map construction method

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-language multi-source vertical domain knowledge graph construction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] see figure 1 . According to the present invention, it includes the following steps: vertical domain translation, domain knowledge pre-labeling training, domain knowledge extraction, domain knowledge fusion and disambiguation, wherein the vertical domain translation is based on the input cross-language text, domain dictionary, domain term base, domain For materials and data, the parallel corpus construction is completed through content and link analysis, and active learning technology is used to realize cross-language automatic corpus labeling, and then based on the parallel corpus, the translation model construction and training based on statistical models and neural network models are completed. , On the basis of preprocessing, based on the trained translation model, automatic translation of foreign texts is realized; domain knowledge pre-labeling training is to realize active learning and labeling based on text segmentation and text clustering according to the text da...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cross-language multi-source vertical domain knowledge map construction method, which relates to the technical field of knowledge engineering. The present invention is achieved through the following technical solutions: vertical domain translation completes parallel corpus construction through content and link analysis according to input cross-language texts, domain dictionaries, domain termbases, domain materials and data; The translation model realizes automatic translation of foreign texts; domain knowledge pre-labeling training realizes active learning and labeling based on text segmentation and text clustering, completes the selection of corpus to be labeled based on analysis topics, and generates confirmed business labeling datasets; Algorithms, combining vertical domain translation data and actual scenes to complete semantic feature extraction and deep learning-based entity relationship extraction; domain knowledge fusion and disambiguation For knowledge from different sources, the network equivalent entities are merged for fusion and disambiguation to obtain cross-language multi-source knowledge Vertical domain knowledge graph.

Description

technical field [0001] The invention relates to the construction of a cross-language knowledge map in the technical field of knowledge engineering, and in particular to a method for building a cross-language multi-source vertical domain knowledge map. Background technique [0002] Knowledge Graph (Knowledge Graph) originated from related academic research fields such as semantic web and graph database. Different fields have different emphasis on knowledge graph research, such as natural language processing, knowledge engineering, machine learning, database and data management. Different research and applications. Knowledge graph construction first needs to identify available data sources, such as structured data, machine-readable open ontology or thesaurus, open linked data and open knowledge bases, industry knowledge bases and industry vertical websites, online encyclopedias (Wiki, Interactive, Baidu) and data such as text. Then, collect data effectively, such as open lin...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/36G06F40/49G06F40/58G06F40/295G06F16/35
CPCG06F16/367G06F40/49G06F40/58G06F40/295G06F16/353
Inventor 崔莹代翔杨露李春豹刘鑫黄刘潘磊
Owner 10TH RES INST OF CETC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products