Automatic editing-after-translating system and method for multisource neural network based on splicing-remixing mode
A post-translation editing and neural network technology, which is applied in the fields of natural language processing and machine translation, can solve problems such as missing translations, improve the overall quality, improve translation fidelity, and improve the overall translation quality
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0058] Example 1
[0059] This embodiment combines with figure 1 , Describes the detailed composition and training and decoding process of a multi-source neural network post-editing system and method based on splicing and remixing of the present invention.
[0060] From figure 1 It can be seen that the training module is connected to the decoding module.
[0061] The training process of the training module includes the following steps:
[0062] Step A: Collect all the corpus needed in the training process of this system;
[0063] Among them, each corpus mainly includes training original corpus and reference translation corpus; among them, training original corpus and reference translation corpus are parallel corpus; assuming N=600000, that is, the training original has 60,000 sentences;
[0064] Training original corpus, denoted as: {source 1 ,source 2 ,…,Source 600000 },
[0065] Training target corpus, denoted as {ref 1 ,ref 2 ,...,Ref 600000 },
[0066] The preliminary translation res...
Example Embodiment
[0093] Example 2
[0094] This embodiment uses specific sentences as examples to illustrate the effects of the system and method.
[0095] In a specific example, the quality of translation is intuitively reflected in fidelity and fluency, where the increase in fidelity is refined to the accuracy of word selection.
[0096] Assume that the original translation reads "However, the challenges of the past are not limited to subsidizing public housing. Private housing is also full of major challenges."
[0097] The preliminary machine translation system uses the Moses statistical machine translation system. The translation result is "however, the pastchallenge, not in the funding of public housing, private housing is full of challenge." In this sentence, the keyword "funding" in the original translation is Translated into "funding", which means "to provide funds for...", it lacks the meaning of help and is not accurate enough. At the same time, the sentence pattern of the original translat...
Example Embodiment
[0100] Example 3
[0101] This embodiment explains in a statistical sense that the system and method directly use the preliminary translation result as the source language training single-source neural network post-editing system compared with the original translation without adding the original text. The advantages of neural network automatic translation editing system in overall translation quality.
[0102] Assume that the training source and reference translation dataset used for the training module has 600,000 sentences, and the translation source dataset used for the test module has 1597 sentences. The preliminary machine translation system uses the Moses statistical machine translation system, the score uses the multi-bleu script, and the BLEU value Represents the overall translation quality. The scores of one yuan to four yuan are the quantitative indicators of fidelity and fluency. The specific scores are described in Table 1 below:
[0103] Table 1: The statistical compari...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap