Method and system for linking entities

A technology for entities and entity names, which is applied in special data processing applications, instruments, unstructured text data retrieval, etc. It can solve the problems of reducing accuracy and increasing the difficulty of entity linking, and achieves the effect of fast and accurate linking
CN106202382AActive Publication Date: 2016-12-07南京柯基数据科技有限公司

Patent Information

Authority / Receiving Office
CN · China
Patent Type
Applications(China)
Current Assignee / Owner
南京柯基数据科技有限公司
Publication Date
2016-12-07

Smart Images

  • Figure 1
    Figure 1
  • Figure 2
    Figure 2
  • Figure 3
    Figure 3
Patent Text Reader

Abstract

The invention discloses a method and a system for linking entities. The method includes acquiring to-be-linked entities from given texts; acquiring entity names and abbreviation word banks from preset knowledge bases and establishing synonym banks of the entity names on the basis of the preset knowledge bases; carrying out searching in the synonym banks by the aid of entity keywords; linking the entity keywords for searching and the entity names in the preset knowledge bases if a certain entry matched with the synonym banks is found by means of searching; generating candidate entities if the certain entry is not found by means of matching and carrying out disambiguation linking in context similarity evaluation modes. The synonym banks contain the entity names acquired from the preset knowledge bases and information data related to the entity names. The entity keywords are acquired by means of word segmentation and are used as search terms. The entity names in the knowledge bases correspond to the entry. The method and the system in an embodiment of the invention have the advantage that the entity linking accuracy can be improved.
Need to check novelty before this filing date? Find Prior Art

Description

technical field

[0001] The invention relates to the field of unstructured text processing, in particular to a method and system for linking entities. Background technique

[0002] With the widespread use of computers and the rapid development of the Internet, the Internet has become a very important channel for people to obtain information. Wikipedia, Interactive Encyclopedia and Baidu Encyclopedia are knowledge bases that are continuously developed on the Internet and edited and constructed by countless netizens. They contain a large amount of structured knowledge, and the pages in the encyclopedias are linked through a special structure to represent the interaction between pages. relation. This kind of knowledge base jointly maintained by netizens has surpassed traditional encyclopedias edited by some experts in terms of quantity, quality, and update frequency, and has become one of the main sources of knowledge for people.

[0003] Among the rapidly increasing data info...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More