Method and device for long statement segmentation aiming at neural machine translation

A technology of machine translation and segmentation device, applied in the field of language translation, can solve the problems of decreased translation effect and poor translation effect, etc.

Active Publication Date: 2016-08-31
IOL WUHAN INFORMATION TECH CO LTD
View PDF5 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Although the NMT model based on the encoder-decoder structure can achieve good translation results, when the source sentence is too long, its translat

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and device for long statement segmentation aiming at neural machine translation
  • Method and device for long statement segmentation aiming at neural machine translation
  • Method and device for long statement segmentation aiming at neural machine translation

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0090] The technical solutions in the embodiments of the present application will be clearly and completely described below with reference to the drawings in the embodiments of the present application. Obviously, the described embodiments are only a part of the embodiments of the present application, but not all of the embodiments. Based on the embodiments in the present application, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present application.

[0091] see figure 1 , which shows the flow of Embodiment 1 of the neural machine translation-oriented long sentence segmentation method provided in this application. like figure 1 As shown, this embodiment may specifically include steps S101 to S104.

[0092] Step S101: After obtaining the source sentence to be translated, determine the length of the source sentence.

[0093] The sentence to be translated may be referred to as the sou...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The application provides a method for long statement segmentation aiming at neural machine translation. The method comprises the steps that before statement translation based on an NMT model, direct input of source statements into the NMT is replaced by segmentation of statements into short sub-statements, and each sub-statement is successively input the NMT model, so that each segmented sub-statement is translated successively by the NMT model respectively; and then the translated sub-statements are directly spliced into a complete sub-statement. The sub-statements which are input into the NMT model for translation are short and translation accuracy of the NMT model is high, so that the accuracy for statement translation is increased. In addition, the application also provides a device for the long statement segmentation aiming at the neural machine translation so as to ensure application and implementation of the method in practice.

Description

technical field [0001] This application relates to the technical field of language translation, and more specifically, to the long sentence segmentation technology for neural machine translation. Background technique [0002] At present, Neural Machine Translation (Neural Machine Translation, abbreviated as NMT) based on deep learning has attracted more and more attention. In the NMT field, a common NMT model is a model based on the encoder-decoder structure. The NMT model mainly translates a sentence in a certain language (hereinafter referred to as a source sentence) into a sentence in another language (hereinafter referred to as a target sentence). [0003] Taking Chinese-English translation as an example, the model based on the encoder-decoder structure mainly obtains the encoding vector after the source sentence is encoded by the encoder, and then uses the decoder to decode the encoding vector to translate into the corresponding English sentence. In fact, the translat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/28G06F17/24G06F17/27
CPCG06F40/166G06F40/211G06F40/58
Inventor 熊德意邝少辉
Owner IOL WUHAN INFORMATION TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products