A neural machine translation method based on a user dictionary

A machine translation and dictionary technology, applied in the field of machine translation, can solve problems such as not being able to get a good response, and achieve the effects of flexible and convenient technology, high-precision requirements, and low coupling

Active Publication Date: 2019-06-04
沈阳雅译网络技术有限公司
View PDF3 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0014] Aiming at the deficiencies in the prior art that directly introducing the user dictionary into the neural network-based machine translation model does not get a go

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A neural machine translation method based on a user dictionary
  • A neural machine translation method based on a user dictionary
  • A neural machine translation method based on a user dictionary

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The present invention will be further elaborated below in conjunction with the accompanying drawings of the description.

[0064] The present invention is used to solve the high-precision requirements of different users for translation translations through user-defined dictionaries. User dictionaries include noun phrases such as professional nouns or named entities, such as figure 1 shown.

[0065] figure 2 For the application of user dictionary technology in the training process of the neural machine translation system, the neural machine translation model is composed of an encoder and a decoder. A method for neural machine translation based on the user dictionary of the present invention includes the following steps:

[0066] 1) Construct user dictionary: Crawl massive data from the network through web crawler technology, and then use named entity recognition technology and named entity extraction technology to obtain bilingual word pairs from the corpus to construc...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a neural machine translation method based on a user dictionary, and the method comprises the steps: crawling mass data from a network through a web crawler technology, and obtaining a bilingual word pair from a corpus through an extraction technology to construct the user dictionary; Using a user dictionary to perform dictionalization on the training corpus, extracting thetraining corpus, and mixing the extracted training corpus with the original corpus to serve as input of neural network model training; Performing consistency detection on the user dictionary placeholders contained in the sentence pairs; Processing the training data by using a user dictionary, inputting the training data into a neural network model, and starting to train the model until the model converges; And inputting a sentence containing the user dictionary, obtaining dictionary information to replace the placeholder, and translating at the same time to obtain a high-precision translated text matched with the information in the user dictionary. During translation, the high-precision requirements of different users on noun phrases and named entities are well met, and the user-defined dictionary library is added according to the requirements and translation habits of the users, so that the high-quality requirements of the different users on translated texts are met.

Description

technical field [0001] The invention relates to a machine translation method, in particular to a neural machine translation method based on a user dictionary. Background technique [0002] Machine translation realizes the function of mutual conversion between two languages, and its translation ability is automatically learned from a large number of parallel corpora. In the neural machine translation task, a high-performance neural machine translation system needs a large amount of bilingual corpus as input. The bilingual parallel sentence pairs in these corpora are processed into multiple continuous word sequences after word segmentation, and then converted into The vector form is input into the encoder for learning and training. This method can learn the connection between the source language and the target language very well, but it cannot meet all translation needs. In some scenarios that require high-precision translations, the following translations may appear inaccura...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06F17/28G06N3/04G06N3/08
CPCY02D10/00
Inventor 杜权徐萍朱靖波肖桐
Owner 沈阳雅译网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products