Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Entity Translation Method Based on Web Retrieval

An entity and historical technology, applied in the field of entity translation based on web retrieval, can solve the problems of low translation efficiency, low translation accuracy, and inability to return accurate information, so as to eliminate ambiguity and improve accuracy and translation efficiency. Effect

Active Publication Date: 2020-06-12
INST OF SOFTWARE - CHINESE ACAD OF SCI
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Because the retrieval results of web retrieval usually cannot return enough accurate information, the existing entity translation based on web retrieval has the defects of low translation accuracy and low translation efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Entity Translation Method Based on Web Retrieval
  • Entity Translation Method Based on Web Retrieval
  • Entity Translation Method Based on Web Retrieval

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is only some embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0024] The present invention provides a method for entity translation based on web retrieval, such as figure 1 As shown, the method includes:

[0025] S11. Using the entity description information in the knowledge base and the entity to be translated to perform a web search.

[0026] S12. Using the entity description information in the know...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a web retrieval based entity translation method. The method comprises the steps that entity description information and an entity to be translated in a knowledge base are utilized for performing web retrieval; the entity description information in the knowledge base is used for performing sequence labeling on a historical retrieval result to obtain at least one candidate entity translation; according to a current character / word TF-IDF value in the historical retrieval result and the occurrence probability of the translation entity and the current character / word, at leastone candidate enhancing word is obtained; statistics is performed on the related statistic amount between the candidate entity translation and the candidate enhancing word, and a retrieval state table is generated or updated; the retrieval state table is adopted as a reinforcement learning state set, the at least one candidate enhancing word and special 'ending' motions are adopted as a reinforcement learning motion set, the optimal retrieval enhancing word selective strategy is obtained through a reinforcement learning mechanism, and the candidate entity translation with the highest occurrence frequency is adopted as the final entity translation in the end. The entity translation accuracy and translation efficiency can be improved, and meanwhile the problems of unlisted words and entityname ambiguity can be avoided.

Description

technical field [0001] The invention relates to the technical field of natural language processing, in particular to a method for entity translation based on web retrieval. Background technique [0002] In recent years, with the emergence and development of a large number of open knowledge bases, the imbalance of knowledge base construction has become prominent. This imbalance is mainly reflected in: the coverage of different knowledge bases is not the same; the knowledge of different languages There are huge gaps in orders of magnitude between libraries. For the construction of a knowledge base in a new domain or language, entity translation technology has the advantages of rapid construction and excellent structural compatibility. [0003] The core of the construction of the translation knowledge base is the translation of knowledge base entities. However, due to the rich connotation of the entity concept, entity translation based on rules or statistical methods often en...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/216
CPCG06F40/44
Inventor 颜令勇孙乐韩先培
Owner INST OF SOFTWARE - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products