Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

XML document translation and evaluation method based on attention mechanism

An evaluation method and attention technology, applied in the field of neural machine translation, can solve the problem of not being able to process well, and achieve the effect of improving the effect of XML translation

Inactive Publication Date: 2021-01-22
沈阳雅译网络技术有限公司
View PDF6 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Aiming at the deficiency that the neural machine model in the prior art cannot handle the relationship between tags and translations in XML well, the technical problem to be solved by the present invention is to provide an XML document translation and evaluation method based on the attention mechanism, using human Translation habits, data preprocessing, postprocessing and other means have improved the translation of XML documents, and completed a new evaluation index to help evaluate the translation effect of XML documents and help to correct the translation effect more accurately

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • XML document translation and evaluation method based on attention mechanism
  • XML document translation and evaluation method based on attention mechanism
  • XML document translation and evaluation method based on attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

example 1

[0077]Original English:

[0078]

[0079] every students

[0080] Mr. Zhang

[0081] You are required to confirm receipt of the announcement

[0082] Students who go out of the city submit their own and parents'application to take photos and send them to me.The application indicates that the epidemic prevention and control and safety problems after leaving schoolshall be in my charge,and my signature shall indicate the basic information , the specific address, the time of going and returning, and the specific trainnumber.

[0083]

[0084]Chinese translation:

[0085]

[0086] All students

[0087] Teacher Zhang

[0088] The announcement requires you to confirm receipt

[0089] The student who goes out of the city office submits the application for me and his parents to take photos and send it to me. The application states that I am responsible for epidemic prevention and control and safety issues after leaving school, and signs, indicating basic information, specific addresses to go to, and time...

example 2

[0092]Original English:

[0093]

[0094] Xiao Li

[0095] Miss Wang

[0096] written request for leave

[0097] I have a fever due to a cold.The doctor suggested that I stayin bed for three days.If I can't insist on studying in school, I'd like to ask for leave for three days(August 8to August 10), please approve .

[0098]

[0099]Chinese translation:

[0100]

[0101] Xiao Li

[0102] Miss Wang

[0103] written request for leave

[0104] I have a fever and a cold. The doctor advised me to stay in bed for three days. I couldn't insist on studying at school. I specifically asked for three days of leave (August 8-August 10). Please approve.

[0105]

[0106]The present invention uses wmt English-Chinese bilingual data to construct an XML parallel corpus. For tags that cannot be processed by traditional models, the present invention uses methods such as user dictionaries to try to preprocess the data, and then use traditional models for training and translation. According to the evaluation system of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an XML document translation and evaluation method based on an attention mechanism. The method comprises the following steps of adopting a translation model based on a self-attention mechanism for model training; constructing a parallel corpus based on an XML language, constructing a word list, and converting the obtained corresponding words from a one-hot vector to a word embedding vector; constructing an XML document preprocessing system, judging and setting whether sentences under various label types need to be translated or not, and replacing labels in the XML sentences needing to be translated with unified symbols; constructing an XML document post-processing system, and restoring uniform symbols in sentences generated by the translation model into labels corresponding to the input files; creating an XML document translation quality evaluation system and determining the accuracy of the label content and number of sentences translated by the model, the sentence translation quality and whether the positions of labels in the sentences are correct or not. Whether sentences under each label type are translated or not is set and selected, and the XML labels are used for constructing the user dictionary, so that the label translation problem is solved.

Description

Technical field[0001]The invention relates to a neural machine translation technology, in particular to an XML document translation and evaluation method based on an attention mechanism.Background technique[0002]Machine translation refers to the process of using a computer to convert one natural language into another natural language. With the development of technology, neural machine translation technology based on deep learning has become the mainstream. Compared with traditional statistical-based machine translation methods, neural machine translation translations perform better in terms of sentence fluency and sentence meaning accuracy. Judging from the BLEU value of the machine translation quality evaluation index, in the case of abundant corpus, the effect of neural machine translation is far better than statistical machine translation. And in some tasks, neural machine translation has achieved results comparable to professional translators.[0003]The achievements of neural mac...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/58G06F40/51G06F40/289G06F40/211
CPCG06F40/58G06F40/51G06F40/211G06F40/289
Inventor 刘兴宇杜权
Owner 沈阳雅译网络技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products