Unlock instant, AI-driven research and patent intelligence for your innovation.

A translation checking method and system thereof

A checking method and checking system technology, applied in the field of translation checking method and its system, can solve problems such as inability to judge whether it is correct or not

Inactive Publication Date: 2018-08-07
NEW FOUNDER HLDG DEV LLC +2
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] The technical problem to be solved by the present invention is that the translation detection method in the prior art depends to a large extent on bilingual experts, and at the same time cannot judge whether it is correct or not, but can only judge the quality of the translation, so as to provide a learning corpus, A translation checking method that trains a bigram model and automatically filters out "incorrect" or "wrong" phrase translations in a large number of related translations

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A translation checking method and system thereof
  • A translation checking method and system thereof
  • A translation checking method and system thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0115] In this embodiment, a translation checking method is provided. The method flow chart is as follows: figure 1 As shown, it includes initialization processing and inspection evaluation processing.

[0116] The initialization process is based on a batch of corpus, the corpus is a corpus of a certain domain, and the binary grammar model is obtained by counting the probability information of the binary segmentation entry of the domain corpus, which is used as an inspection scoring model for the inspection and determination process Provide a basis for scoring the translation.

[0117] In the inspection evaluation process, the domain translation is scored by the inspection scoring model, the translation score is compared with a preset judgment threshold, and the translation with a score less than the preset judgment threshold is judged as "error", otherwise it is judged as "correct" .

[0118] The steps of the initialization process are as follows:

[0119] S11: Obtain a batch of te...

Embodiment 2

[0167] In this embodiment, except that step S13 is different from embodiment 1, other steps are the same as embodiment 1. In step S13, w i At the beginning w j Probability of occurrence f p (w i ,w j ) Method is:

[0168] Get all items with w from list L i The first binary segmentation entry, the second character w in the binary segmentation entry j Join the string S;

[0169] Store each character in the character string S into a set T;

[0170] Count the length n of the character string S, for each character w in the set T j , Count the character w j The number of occurrences m in the string S;

[0171] Then w i At the beginning w j Probability of occurrence f p (w i ,w j )for

[0172] f p (w i ,w j )=m / n

[0173] Among them, the initial value of m and n is zero.

[0174] First, count the length n of the character string S and each character w in the set T j , Count the character w j The number of occurrences m in the string S, and then take the ratio of n and m as w i At the beginning w...

Embodiment 3

[0176] In this embodiment, except that step S23 is different from embodiment 1, other steps are the same as embodiment 1. The method for scoring and evaluating the translation in step S23 is:

[0177] Score Translate =avg{Score i ,i=1,2,…,n-1}

[0178] Among them, Score i Is the score of a binary segmentation item, Score i =f p (w i ,w j ), f p (w i ,w j ) Is the binary segmentation item in the translation (w i w j ) The corresponding value in the model.

[0179] In the translation check method provided in this embodiment, the method for scoring the translation adopts the average value of each binary entry in the translation, which can effectively avoid the mis-scoring caused by some binary entries not included in the model.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention provides a method and device for checking a translation, comprising initialization and determination processing. Said initialization processing: training a bigram model on the basis of a batch of corpora, the bigram model serving as a scoring model and providing a basis for scoring a translation during said determination processing. Said determination processing: scoring a translation by means of said scoring model, comparing the translation score to a preset determination threshold, and determining a translation having a score of less than said preset threshold value to be "incorrect", otherwise, determining the translation to be "correct". The described technical solution effectively avoids the problem in the prior art of translation testing methods relying heavily on bilingual specialists or high-quality manual translations for reference, while also being unable to determine whether a translation is correct or not and only being able to judge the extent to which a translation is good or bad.

Description

Technical field [0001] The invention relates to a translation checking method and a system thereof, in particular to a translation checking method and a system based on a binary grammar model, belonging to the technical field of electric digital data processing methods. Background technique [0002] In recent years, the application of machine translation (Machine Translation) has become more and more extensive, and the quality requirements for machine translation translation have become higher and higher. In the field of translation, even the best translators can hardly meet the highest standards "trustworthiness, expressiveness, and elegance" required by the translation industry. Therefore, machine translation evaluation has become an important and difficult subject. [0003] Since it is necessary to know at least two languages ​​to evaluate translation quality, translation quality evaluation has become a very difficult intellectual activity. Therefore, the evaluation of the trans...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F17/27G06F17/28
CPCG06F40/51
Inventor 叶茂王元龙金立峰汤帜徐剑波
Owner NEW FOUNDER HLDG DEV LLC
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More