The present invention relates to a neural
machine translation method for introducing source language block information to
encode. The method comprises: inputting bilingual
sentence-level parallel data, and carrying out word segmentation on the source language and the target language respectively to obtain bilingual parallel
sentence pairs after being subject to word segmentation; encoding the source
sentence in the bilingual parallel sentence pairs after being subject to word segmentation according to the
time sequence, obtaining the state of each
time sequence on the
hidden layer of the lastlayer, and segmenting the input source sentence by blocks; according to the state of each
time sequence of the source sentence and the segmentation information of the source sentence, obtaining the block encoding information of the source sentence;
combing the time sequence encoding information with the block encoding information to obtain final source sentence memory information; and by dynamically querying the source sentence memory information, using attention mechanism to generate a
context vector at each moment through a decoder network, and extracting feature vectors for word prediction.According to the method provided by the present invention, block segmentation is automatically carried out on the source sentence without the need of any pre-divided sentence to participate in the training, and the method can capture the latest and the best block segmentation manner of the source sentence.