Method and system for deep nerve translation based on character encoding

A deep neural and character encoding technology, applied in the field of deep neural translation methods and systems, can solve problems such as increasing the difficulty of training, destroying model integrity, high-dimensional and sparse data, and enhancing long-term memory function.

Active Publication Date: 2016-11-16
HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
View PDF4 Cites 123 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the traditional statistics-based machine translation model has the following problems: First, we need to preprocess the source language and target language, which is prone to high-dimensional and sparse data problems; each module in the translation model is relatively independent, which increases the difficulty of training ; After the model is generated, if there are uncommon words, it needs to be reprocessed, destroying the integrity of the model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for deep nerve translation based on character encoding
  • Method and system for deep nerve translation based on character encoding
  • Method and system for deep nerve translation based on character encoding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0055] The character encoding-based deep neural translation method and system of the present invention. The main innovative work of the present invention is the following two parts: 1) translation model encoder module; 2) translation model decoder module. The first part encodes the input data, and uses the recurrent neural network to segment the data input at the character level and establish a language model. The second part, the decoder part, uses the source language and the target language to establish a word alignment model, and selects the optimal result for the output of the calculated candi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a system for deep nerve translation based on character encoding. A combined nerve network model is established by using an RNN to cover the whole translation process, and translation tasks are directly completed from the perspective of an encoder-decoder framework. The method comprises the following steps: A, word vector generation: performing word segmentation on character-level input data through neural network modeling and generating a word vector; B, language model generation: establishing grammar rules by utilizing the characteristic of memory of the recurrent neural network in time; C, word alignment model generation: obtaining the probability of translating multiple words in a source language statement into target language words; D, output: translating an inputted source language into a target language; E, translation model combination: establishing a deep nerve translation model (RNN-embed) based on character encoding in combination with neural network models in the four steps and accelerating model training by using CPU parallel computation.

Description

technical field [0001] The invention relates to the technical field of machine translation, in particular to a method and system for deep neural translation based on character coding. Background technique [0002] Machine translation is often called automatic translation technology. By using the programming ability of computers, one language is automatically converted into another language. The former is called the source language and the latter is called the target language. Today, the broad prospects of the subject of machine translation have been recognized, and there is no doubt that it will be a hot spot of applied technology. [0003] At present, machine translation can be divided into rule-based and corpus-based methods as a whole. Among them, the rule-based method has been studied to the syntactic stage, but its application in the general field is not strong, and it is often limited to specialized fields. There are applications. The corpus-based method can be subdi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28
CPCG06F40/44G06F40/45
Inventor 张海军李婧萱
Owner HARBIN INST OF TECH SHENZHEN GRADUATE SCHOOL
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products