Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Entity disambiguation method and device based on UCL knowledge space

An entity and knowledge technology, applied in the field of knowledge graph construction in the Internet, can solve the problem of low disambiguation accuracy, and achieve the effect of improving accuracy and effect.

Pending Publication Date: 2021-07-13
SOUTHEAST UNIV
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This can effectively solve the problem of low disambiguation accuracy caused by insufficient entity context information in short texts, and improve the effect of entity disambiguation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity disambiguation method and device based on UCL knowledge space
  • Entity disambiguation method and device based on UCL knowledge space
  • Entity disambiguation method and device based on UCL knowledge space

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0031] The technical solutions provided by the present invention will be described in detail below in conjunction with specific examples. It should be understood that the following specific embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention.

[0032] Such as figure 1 As shown, an entity disambiguation method based on UCL knowledge space disclosed in the embodiment of the present invention, the specific implementation steps are as follows:

[0033] Step 1, the construction of UCL knowledge space. Use information extraction related technologies to obtain entities, basic attributes of entities and associations between entities from open offline databases, and build a basic knowledge base; acquire network news, use UCL to index network news, as a supplement to the knowledge base, and complete the construction of UCL knowledge space . This step is the prerequisite work of the present invention, and concre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides an entity disambiguation method and device based on a UCL knowledge space, and the method comprises the steps: firstly constructing a basic knowledge base, and completing the construction of the UCL knowledge space; then obtaining a candidate entity set related to the entity to be disambiguated from the UCL knowledge space, and generating an embedded representation of the candidate entity and the entity to be disambiguated by using a word vector representation method; then extracting concept features of an entity to be disambiguated and contexts thereof, and extracting features of contexts of candidate entities; finally, the four vector representations generated previously are used as input, a self-attention matching network based on a deep structured semantic matching model DSSM is adopted, and the matching degree is obtained; and obtaining a final disambiguation result according to the sorting of matching results, and completing entity linking between the entities in the text and the entities in the UCL knowledge space. According to the method and the device, the problem of less entity related information in the short text can be solved, and the entity disambiguation accuracy can be improved.

Description

technical field [0001] The invention relates to a UCL knowledge space-based entity disambiguation method and device, and belongs to the technical field of knowledge map construction in the Internet. Background technique [0002] With the rapid development of the Internet, the number of online news has increased sharply, and the knowledge information contained in the news has become more and more complicated. There is an urgent need for a suitable carrier to effectively store and manage news information. The knowledge map can associate entities together to form a graph database by constructing "entity-relationship-entity" triples and "entity-attribute (value)" key-value pairs. The unified content label UCL (Uniform Content Label) defined by the national standard "Uniform Content Label Format Specification" (GB / T 35304-2017) can provide rich semantic information, and its content format includes who, what, where, Elements that are highly consistent with news events such as wha...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F16/28G06N3/04G06N3/08
CPCG06F16/288G06N3/049G06N3/08G06N3/045
Inventor 杨鹏常欣辰范路平于晓潭
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products