Cross-language knowledge graph entity alignment method based on GCN twinning network

A knowledge map and twin network technology, applied in the field of natural language processing, can solve problems such as low alignment accuracy, neglect of related properties, underutilization of attribute information and relationship information interaction, etc.

Active Publication Date: 2019-11-19
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF2 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Most of the traditional entity alignment methods adopt the method of predicting through the respective attributes of the entities, ignoring the correlation between the entities themselves, or not making full use of the interaction between attribute information and relationship information, or in the negative sampling stage The correct negative samples are not obtained for learning, so the overall alignment accuracy rate is not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-language knowledge graph entity alignment method based on GCN twinning network
  • Cross-language knowledge graph entity alignment method based on GCN twinning network
  • Cross-language knowledge graph entity alignment method based on GCN twinning network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0038] figure 1 It is a flowchart of a specific implementation of the method for aligning entities in cross-language knowledge graphs based on the GCN twin network of the present invention. Such as figure 1 As shown, the specific steps of the cross-language knowledge map entity alignment method based on the GCN twin network of the present invention include:

[0039] S101: Knowledge map information extraction:

[0040] For the knowledge graph KG of two languages 1 、KG 2 , to extract the information of each knowledge graph separately, the specific method is as follows:

[0041] For knowledge map KG i , i=1,2, extract its relation triplet and attribute triplet, and the relation triplet is recorded as [a i (j),b i (j, j′), a i (j')], a i (j), a i (j′) respectively represent the knowledge map KG i The jth and j′th entities in , 1≤j≠j′≤N i , N i Represents the knowledge map KG i The number of entities in b i (j, j′) represents entity a i (j), a i The relationship be...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a cross-language knowledge graph entity alignment method based on a GCN twinning network. The method comprises the steps: firstly respectively extracting an adjacent matrix andan attribute information matrix for knowledge graphs of two languages, enabling the known alignment entity pairs of the knowledge graphs of the two languages to serve as positive samples, and forminga positive sample set; respectively constructing a GCN twin network for the relationship structure information and the attribute information, respectively recording the relationship structure information and the attribute information as GCN _ SE and GCN _ AE, and training the GCN twin network GCN _ SE by taking adjacent matrixes of the two knowledge maps as inputs of the GCN twin network GCN _ SE; taking the adjacency matrix and the attribute information matrix of the two knowledge maps as the input of a GCN twin network GCN _ AE, and training the GCN twin network GCN _ AE; and obtaining a potential alignment entity result based on the trained GCN twin network. According to the method, only relation structure information and attribute information between entities in the multi-language knowledge graph and part of aligned entities are needed as a training set, and multiple potential aligned entity pairs can be inferred at the same time without acquiring extra entity feature data.

Description

technical field [0001] The invention belongs to the technical field of natural language processing, and more specifically, relates to a method for aligning cross-language knowledge graph entities based on GCN twin networks. Background technique [0002] With the development of the Internet, the amount of data information has exploded, the data is redundant, and the scale is huge. In order to solve this series of problems, the knowledge graph technology aimed at describing the entities existing in the real world and the relationship between entities was born. In order to obtain a more complete knowledge map, a method of integrating multiple knowledge bases in different languages ​​can be used to obtain a multilingual knowledge map containing more information and entities. There are some known cross-language links in this kind of knowledge graph, connecting the same entity pointed to by multiple languages, that is, entity alignment. For example, there is an entity "Shanghai P...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/36G06N3/04G06N3/08
CPCG06F16/367G06N3/08G06N3/045
Inventor 罗绪成谭俊杰
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products