Check patentability & draft patents in minutes with Patsnap Eureka AI!

Translation processing method and system

A processing method and technology of translation models, applied in the field of machine translation, can solve the problems of multi-parallel corpus, poor translation effect of translation models, lack of quality, etc., and achieve the effects of best translation quality, good consistency and practicability

Active Publication Date: 2018-11-23
TSINGHUA UNIV +1
View PDF4 Cites 37 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Therefore, machine translation based on neural networks faces a big problem: most language pairs do not have high-quality, large-scale parallel corpora
However, since each language has its own unique characteristics such as word order, vocabulary, etc., only using the "shared" neural network to train a multilingual translation model may ignore the characteristics of each language, resulting in the translation effect of the translation model changing. Difference

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Translation processing method and system
  • Translation processing method and system
  • Translation processing method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0034] In the prior art, a multilingual neural machine translation model is proposed to alleviate the problem of data sparsity. The core idea of ​​these methods is "sharing", that is, using the parallel prediction of multiple language pairs to train the neural machine translation model, sharing some sub-nodes of the neural network or even the entire neural network, so as to solve the sparse training corpus to a certain extent. The problem. However, since each language has its own unique characteristics such as word order, vocabulary, etc., only using the "shared" neural network to train a multilingual translation model may ignore the characteristics of each language, resu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Embodiments of the invention provide a translation processing method and system. The method comprises the steps of obtaining a statement of a source language; coding the statement of the source language to obtain a vector sequence, wherein the vector sequence comprises word vectors formed by converting segmented words obtained by segmenting the statement; predicting corresponding candidate words in a target language word by word according to the vector sequence; and generating a statement of the target language according to the predicted candidate words, in prediction processing of any one ofthe candidate words, obtaining a plurality of primarily selected words from a preset translation word list, and calculating translation probabilities of the primarily selected words according to a pre-trained machine translation model, thereby selecting the candidate words from the primarily selected words according to the translation probabilities. According to the translation processing method and system, a language pair of data sparseness can obtain better translation quality.

Description

technical field [0001] The present invention relates to the technical field of machine translation, and more specifically, to a translation processing method and system. Background technique [0002] With the deepening of international exchanges, people's demand for language translation is increasing day by day. However, there are many kinds of languages ​​in the world, each with its own characteristics and flexible forms, making the training of machine translation models between all language pairs an unsolved problem. [0003] In order to realize automatic machine translation, current technologies are usually based on neural network methods. Neural networks are data-driven, and for this, large-scale, high-quality parallel corpora need to be collected to obtain reliable translation models. However, high-quality parallel corpora often only exist among a small number of languages, and are often limited to certain specific fields, such as government documents, news, etc. [...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28G06F17/27
CPCG06F40/279G06F40/44G06F40/58
Inventor 刘洋丁延卓栾焕博孙茂松翟飞飞许静芳
Owner TSINGHUA UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More