A Method of Entity Linking Based on Graph Model

A graph model, entity technology, applied in text database query, unstructured text data retrieval, text database indexing and other directions, can solve the problems of insufficient semantic information and insufficient use of unambiguous entities.

Active Publication Date: 2021-07-27
SOUTHEAST UNIV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Although there have been a lot of researches on entity linking and graph-based entity disambiguation methods, the common problem of existing methods is that they do not give full play to the role of unambiguous entities, and the semantic information in entity association graphs is not enriched with the addition of unambiguous entities

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Method of Entity Linking Based on Graph Model
  • A Method of Entity Linking Based on Graph Model
  • A Method of Entity Linking Based on Graph Model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] The implementation process of the present invention will be described in detail below in conjunction with the embodiments and the accompanying drawings.

[0051] The entity linking method based on graph model of the present invention comprises the following steps:

[0052] 1) Offline data processing. It is divided into two parts: one is to establish an inverted index for all entity information in the knowledge base, and the other is to perform vectorized representation for each entity in the knowledge base.

[0053] 1a) Build knowledge base entity index. The entity information to be stored in the knowledge base includes title (Title), category (Category), information box (Infobox) key-value pair and abstract (Abstract), etc., corresponding to each entity is a Document object, each Document object Contains fields such as title, table of contents, information box, abstract, etc.

[0054] 1b) Obtain the semantic vector representation of knowledge base entities. It is d...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a graph model-based entity linking method, which is mainly used for dealing with the entity linking problem of unstructured text. The present invention first constructs an entity association graph of the text from all entity reference items and corresponding candidate entity sets obtained in the same text, as the basis of the dynamic entity disambiguation algorithm. Then, using the dynamic entity disambiguation algorithm based on graph and PageRank, the undisambiguated candidate entity with the highest score is selected in each round as the target entity of the entity reference, and the disambiguation selection process of the entity reference corresponding to multiple candidate entities is gradually completed. Finally, use XGBoost in the field of machine learning to judge the target entity referred to by the entity, correctly link the registered target entities in the knowledge base, and correctly identify the unregistered target entities in the knowledge base.

Description

technical field [0001] The invention belongs to the field of entity linking and relates to an entity linking method based on a graph model. Background technique [0002] Since the concept of the Semantic Web was proposed, more and more open link data and user-generated content have been published on the Internet, and the Internet has gradually changed from a document that only contains hyperlinks between web pages to a World Wide Web that contains a large number of descriptions of various entities and The World Wide Web of Data rich in relationships between entities. While Internet webpages, such as news, blogs, etc., involve a large number of entities, most of the webpages themselves do not have relevant descriptions and background introductions about these entities. In order to help people better understand the content of the webpage, many websites or authors will establish a link relationship between the entities that appear in the webpage and the corresponding knowledge...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F16/31G06F16/33G06F16/35G06F16/36
Inventor 邢昊天漆桂林高桓
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products