Neural machine translation method based on similarity perception

A machine translation and similarity technology, which is applied in the field of natural language processing machine translation, can solve the problems of high-potential sentence recognition and optimization of untranslated sentences, redundant review and heavy post-translation editing workload, etc., to reduce review time, The effect of easy recognition

Active Publication Date: 2020-09-01
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF2 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to solve the technical problem that the existing machine translation system cannot identify and optimize high-potential sentences for sentences to be translated in practical applications, resulting in redundant review and heavy post-translation editing workload for human translators. A neural machine translation method based on similarity perception

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Neural machine translation method based on similarity perception
  • Neural machine translation method based on similarity perception
  • Neural machine translation method based on similarity perception

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] The following is attached with the manual figure 1 The present invention is described further.

[0027] A neural machine translation method based on similarity perception, comprising the following steps:

[0028] Step 1: Build a structural translation memory and a corresponding structural similarity algorithm for retrieving high-potential structural sentences.

[0029] Specifically, firstly, using the component syntax tree analysis method, the parallel syntax tree pairs are extracted from the parallel sentence pairs in the training corpus. Then, parallel syntactic tree pairs are used to construct a structural translation memory, and a corresponding structural similarity algorithm is designed to retrieve high potential structural sentences and calculate structural similarity.

[0030] Such as figure 1 As shown, the structural translation memory includes two data composition methods:

[0031] (1) In the recognition module, a structural translation memory library consi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a neural machine translation method based on similarity perception, and belongs to the technical field of natural language processing machine translation. The method comprises the following steps of firstly, creating a structure translation memory bank and a corresponding structure similarity algorithm; constructing a template translation memory bank and a corresponding template similarity algorithm; and then pre-identifying high-potential statements of character strings, structures and template dimensions in the test set; constructing multi-dimensional similarity prioriknowledge, and performing multi-dimensional similarity retrieval on all parallel statements in the training set. A posteriori regular objective function is utilized, discrete similarity priori knowledge is fused into a neural machine translation objective function, parameters of the priori knowledge are continuously iterated and updated, and a training process is guided. Finally, the multi-dimensional high-potential to-be-translated statements are translated by utilizing the trained neural translation model. The most similar sentences can be obtained in a finer-grained manner, and the reviewtime of human translators is shortened.

Description

technical field [0001] The present invention relates to a technique for modeling the multidimensional similarity of parallel corpora in neural machine translation and can identify high-potential sentences in a test set and optimize their corresponding translation performance, in particular to a neural machine translation system based on similarity perception The invention and method belong to the technical field of natural language processing machine translation. Background technique [0002] At present, neural machine translation is widely used in multi-scenario computer-aided translation tasks because it is better than traditional statistical machine translation in a variety of natural languages. However, most existing neural machine translation methods focus on how to improve the overall translation performance, paying little attention to the workload of human translators. [0003] In computer-assisted translation scenarios, human translators receive translations generat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/47
CPCG06F40/58G06F40/47
Inventor 冯冲张天夫
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products