Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Entity alignment method in four-insurance-one-fund domain knowledge graphs

A technology of knowledge graph and domain knowledge, applied in neural learning methods, finance, instruments, etc., can solve problems such as poor versatility, limited alignment quality, and ignoring word order information

Active Publication Date: 2020-12-25
HARBIN ENG UNIV
View PDF12 Cites 28 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Matthias[18] and others proposed to align the entities of the knowledge graph with the described text. First, both the text and the knowledge graph are mapped to the vector space through representation learning, and the vector representations of the text and entities are learned, and regularization is added in the learning process. Constraints make the entity vector and description text representing the same entity as close as possible in the vector space, and the representation learning models of knowledge graph and text are trained alternately, so as to achieve alignment without affecting the respective representation learning effects of text and knowledge graph , this method relies on the text information describing the entity, and has poor versatility
Fan et al. [19] proposed the DKRL model modeling triplet in 2017. This model not only considers the energy function based on the relationship representation, but also considers the energy function based on the description information, and proposes two methods for encoding the description information of entities. Method: The continuous bag of words model accumulates the first k keywords of the description information, but this method ignores the word order information, so a method of encoding through a convolutional neural network is proposed
[0013] In the task of large-scale entity matching, due to the imbalance of knowledge graph data, the entity alignment quality of the automatic entity alignment method is limited. Therefore, Zhuang et al. [20] proposed to use the crowdsourcing platform to improve the alignment effect in 2017. First, Partition the knowledge map and perform rough entity alignment, then put aligned entity pairs and unaligned entity pairs into the crowdsourcing platform, and finally establish a partial order relationship between entities to eliminate potential errors and optimize crowdsourcing alignment. Entity effect, but the method still requires a lot of manual operation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity alignment method in four-insurance-one-fund domain knowledge graphs
  • Entity alignment method in four-insurance-one-fund domain knowledge graphs
  • Entity alignment method in four-insurance-one-fund domain knowledge graphs

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0114] The present invention provides a knowledge map entity alignment method suitable for the field of four insurances and one fund, which can effectively solve the problem that existing methods cannot capture equivalent information from full names and abbreviations due to inconsistencies in entity names. The invention includes (1) building a knowledge map for alignment (2) dividing the original data into relational triples and attribute triplets (3) automatically marking equivalent entities for training according to entity names (4) setting attributes Threshold to filter attribute triplets (5) Train knowledge map joint embedding representation (6) Calculate LCS similarity of entities (7) Select candidate entity pairs, and disambiguate entity pairs that need to be disambiguated; specifically include the following steps :

[0115] Step 1: Input the relation triplet and attribute triplet of the knowledge graphs S and T to be aligned, and convert the data into n-triple format; ...

Embodiment 2

[0152] The present invention discovers equivalent entities from two heterogeneous knowledge graphs, and the specific process is as follows figure 1 As shown, the corpus used is the four insurances and one housing fund knowledge map and the Chinese knowledge map cn-dbpedia constructed by applying entity recognition and relationship classification technology from the text of policies and regulations. This invention will take this as an example to introduce the specific implementation of entity alignment .

[0153] Step 1: Acquisition and normalization of knowledge map data:

[0154] Step 1.1: Input the relational triples and attribute triplets of the knowledge graphs S and T to be aligned, and convert the source data into n-triple format, where each row contains a triplet, between the head entity, relation, and tail entity Use the '\t' symbol to separate. Save to rel_triples_1, rel_triples_2, attr_triples_1, attr_triples_2 four files;

[0155] Step 1.2: Mark the entities with...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention belongs to the technical field of knowledge maps, and particularly relates to an entity alignment method in four-insurance-one-fund domain knowledge graphs. A strategy of dynamically adjusting an embedding loss function according to a relationship type in a TransE model is provided, the TransE loss function under different attributes and relationship conditions is optimized by addingan embedding layer, embedding results of an LSTM network and a Bert to attribute values are fused, semantic information of the attribute values is considered in embedding of the attribute values. Structure embedding and attribute embedding joint training are carried out to finally obtain joint embedding representation of the knowledge graphs. Semantic information and character information of entity embedding are combined, the entity embedding distance and the LCS similarity are combined, and candidate entity pairs are selected under the condition of considering the two similarities. The problem of difficult alignment caused by insufficient link number in the Chinese knowledge graph and inconsistent names of entities in different knowledge graphs can be effectively solved.

Description

technical field [0001] The invention belongs to the technical field of knowledge graphs, and in particular relates to a method for aligning entities in knowledge graphs in the field of four insurances and one fund. Background technique [0002] Since Google proposed the concept of knowledge graphs in 2012, knowledge graphs have developed rapidly, and a number of large-scale knowledge graphs represented by DBpedia, Freebase, Wikidata, YAGO, etc. have emerged; however, the data sources of knowledge graphs are extensive and the data quality is uneven. , leading to the diversity and heterogeneity of different knowledge graphs, there are multiple different entities pointing to the same real-world object in different knowledge graphs, so knowledge fusion has become an important link in the research of knowledge graphs. As a key technology in knowledge fusion, entity alignment, also known as entity matching, is a technique to infer whether different entities from different knowledg...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/36G06F16/35G06F40/166G06F40/189G06F40/211G06F40/30G06N3/04G06N3/08G06Q10/10G06Q40/08
CPCG06F16/367G06F16/355G06F40/189G06F40/211G06F40/166G06F40/30G06N3/049G06N3/08G06Q10/1057G06Q40/08
Inventor 黄少滨何荣博申林山李熔盛
Owner HARBIN ENG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products