Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Form entity linking method based on multiple knowledge bases

A knowledge base and entity technology, applied in the field of table entity linking based on multiple knowledge bases, can solve problems such as non-universal, affecting the quality of table entity linking, unreasonable, etc., and achieve a strong practical effect

Active Publication Date: 2017-03-15
SOUTHEAST UNIV
View PDF4 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are two main problems in existing methods and systems for linking table entities: 1) Many methods or systems rely on features based on specific information, such as column headings and entity types in the knowledge base, but most of them are extracted from the World Wide Web There are no column headers in the tables, and many knowledge bases do not have semantic information such as entity types, which makes these methods and systems not universal and less practical; 2) All current methods and systems are based on a single knowledge base. Table entity links, but this does not guarantee the quality of table entity links. Many entities in tables do not exist in a single knowledge base, so it is unreasonable to link entities only for a single knowledge base
TabEL is more advanced than LIEGE, because TabEL can perform entity linking based on any single knowledge base for tables with multiple rows and columns, but the system still cannot complete the task of linking table entities based on multiple knowledge bases, because many strings should be Linked entities do not exist in a given single knowledge base, resulting in unsatisfactory quality of tabular entity linking using the TabEL system
In addition, the system relies on prior probabilities calculated from different sources, and each source has its own emphasis, resulting in unobjective prior probabilities obtained and easily affecting the quality of table entity links

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Form entity linking method based on multiple knowledge bases
  • Form entity linking method based on multiple knowledge bases
  • Form entity linking method based on multiple knowledge bases

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The implementation process of the present invention will be described in detail below in conjunction with the embodiments and the accompanying drawings.

[0045] The present invention is based on the table entity linking method of multi-knowledge base, comprises following 3 steps:

[0046] 1) Every time from the knowledge base collection K={KB 1 , KB 2 ,...,KB z ..., KB n} to select a single knowledge base KB z , from the single knowledge base KB according to the following method z Extract candidate entities from , construct a list of candidate entities, and finally obtain a list of candidate entities constructed by each single knowledge base. The detailed steps are as follows:

[0047] Since it is impractical to use millions of entities in the knowledge base as candidate entities for each string, it is necessary to use an efficient and low-cost method to quickly select several possible candidate entities for each string , in order to use more complex methods to f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a form entity linking method based on multiple knowledge bases. The method is mainly used for solving the problem of entity linking in a form. The method includes the steps that firstly, a candidate entity is generated for a character string in each cell in a given form, and the candidate entities are extracted from the self-given knowledge bases; then a universal probability propagation algorithm based on a map is provided to rank the candidate entities corresponding to the character strings in the cells. The method can act on any single knowledge base. According to the ranking result of the candidate entities based on the different single knowledge bases, by means of the equivalence relation among the entities from the different knowledge bases, the ranked candidate entities, extracted from the different knowledge bases, corresponding to the character strings in the cells are divided. Finally, three heuristic rules are used for finally determining the entities, existing in the different knowledge bases, needing to be linked with the character strings in the cells, and therefore tasks of form entity linking based on the multiple knowledge bases are completed.

Description

technical field [0001] The invention belongs to the field of entity linking and relates to a table entity linking method based on multiple knowledge bases. Background technique [0002] There are a large number of HTML tables with high-quality relational data in the current World Wide Web, and these tables are regarded as important sources for knowledge extraction from the World Wide Web. In order to realize the vision of the Semantic World Wide Web, many works try to mine the latent semantic information in the tables, and represent the content in a given table as RDF triples. The first step in semantic information mining of table content is entity linking. Entity linking is to identify the true meaning of the strings in each cell in the table, and link these strings to entities in a given knowledge base. If the potential entities in the table cannot be correctly identified, it will be difficult to mine the correct RDF triples from the content of the given table, so linking...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/30G06N5/02
CPCG06F16/288G06F16/367G06N5/022
Inventor 吴天星漆桂林刘太云严晟嘉朴智新许亮王瑞明
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products