Multi-language-pair neural network machine translation method and system

A neural network and machine translation technology, applied in the field of multilingual neural network machine translation methods and systems, can solve problems such as wasting server resources, increasing translation speed, and poor translation quality, so as to save server resources, improve translation quality, Effect of reducing vocabulary size

Inactive Publication Date: 2018-09-21
GLOBAL TONE COMM TECH
View PDF3 Cites 56 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] (1) High-quality bilingual parallel corpus is very scarce, especially parallel corpus between minor languages
For machine translation, there is a lack of high-quality bilingual parallel corpus, and it is very difficult to train a usable machine translation system. The translation quality is poor, and the translation performance is poor, especially in terms of fidelity.
Although using English as the interlanguage can solve some problems, the translation effect is not ideal, and the translation speed is doubled
Moreover, a machine translation system can only translate language pairs in one direction, and language pairs that are used infrequently will waste server resources

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-language-pair neural network machine translation method and system
  • Multi-language-pair neural network machine translation method and system
  • Multi-language-pair neural network machine translation method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] In order to make the object, technical solution and advantages of the present invention more clear, the present invention will be further described in detail below in conjunction with the examples. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0053] The neural network machine translation method for multilingual pairs provided by the embodiment of the present invention uses multiple bilingual parallel corpora of the same language family, and after byte pair coding, maps them to the same high-dimensional vector space, so that multiple languages ​​can share the same semantic space ; Vocabularies of the same language family are in the same vector space, and in the translation of language directions without direct bilingual parallel corpora, learn information from each other; for translation directions with low usage rates, use the same model for translation.

[0054...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of computer software and discloses a multi-language-pair neural network machine translation method and system. A plurality of bilingual parallel corpora ofa same language system are utilized and mapped to a same high-dimensional vector space after byte pair encoding, so that multiple languages share a same semantic space, the size of a word list is reduced, model parameters are reduced, and convergence of a model is accelerated. Words of a same language family are in the same vector space, more information can be learned mutually, the information which can not be learned through only certain bilingual parallel corpora can be learnt, and the quality of word vectors is improved. The machine translation system can be used for translation in the language direction without direct bilingual parallel corpora, and the translation quality in the scarce parallel corpus translation direction is greatly improved through mutual information learning. Meanwhile, the same model is used for translation for the translation direction low in utilization rate, occupation of a server is reduced, and the utilization rate of the server is increased.

Description

technical field [0001] The invention belongs to the technical field of computer software, and in particular relates to a neural network machine translation method and system for multilingual pairs. Background technique [0002] At present, the existing technologies commonly used in the industry are as follows: [0003] Machine translation is a process of translating one natural language into another natural language using machine learning techniques. As an important branch of computational linguistics, it involves cognitive science, linguistics and other disciplines, and is one of the ultimate goals of artificial intelligence. [0004] The existing mainstream machine translation model uses an encoding-decoding structure based on a self-attention mechanism, which consists of an encoder and a decoder. Both are dominated by the self-attention layer. [0005] The translation process mainly includes: first, map the input word to a high-dimensional vector space to obtain a wor...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/27G06F17/28G06N3/08
CPCG06N3/08G06F40/216G06F40/289G06F40/58
Inventor 贝超程国艮
Owner GLOBAL TONE COMM TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products