Document character processing method

A processing method and document technology, applied in the field of translation, can solve the problems of increasing the translator's translation intensity, inconsistency in translation, reducing translation efficiency, etc., and achieve the effects of low typesetting difficulty, shortened translation work time, and reduced flexibility.

Inactive Publication Date: 2016-07-27
张广睿 +1
View PDF9 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the existing technology, for a project or a long document, it is often divided into multiple parts in a team for translation, but due to the different translation habits of translators, different translators often translate a sentence with the same meaning, resulting in inconsistent translations Happening
In addition, this method of dividing the team into multiple translations causes the translators to repeatedly translate words, phrases or single sentences with the same meaning, which not only greatly increases the translation intensity of the translators, but also greatly reduces the translation efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Document character processing method
  • Document character processing method
  • Document character processing method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] A document text processing method, comprising the following steps:

[0033] (1) Extract the text information in the document to be translated, convert the document to be translated into a Word document or an Excel document, etc., and then process the extracted text information by clearing the format function or copying and replacing the function to unify the format of the text information, thereby Get a document in a uniform format, as attached figure 1 shown.

[0034] (2) Use one or a combination of line breaks, punctuation marks, spaces, etc. to automatically split the document after the unified format, and split it into any one of words, phrases, and single sentences Or several text data collections to be translated as the smallest unit, as shown in the attached figure 2 shown. After splitting, classify the types of text, punctuation marks, numbers, letters, etc., and remove the non-translated text in the document, as shown in the attached image 3 As shown, the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a document character processing method. The method comprises following steps of (1), extracting character information in a to-be-translated document; unifying a format; (2), automatically splitting the document into to-be-translated character data sets with minimum units; removing non-translation characters and repeated to-be-translated character data in the to-be-translated character data sets; (3), establishing a processing document before translation; copying the to-be-translated character data in the to-be-translated character data sets to an original text list; writing the original text and the translated text of related specialized terms in corresponding specialized term lists, thus obtaining the processing document before translation matched with the specialized terms; (4), translating the to-be-translated character data corresponding to the original text list in the processing document before translation by a translator, thus obtaining a translated processing document; and (5), replacing the translated context by the original text by using a replacement function, thus obtaining the translated text. According to the method, the repeated words, phrases or simple sentences in the document can be removed in advance before the document is processed, the translation quantity of the translator is reduced, and the translation efficiency is improved.

Description

technical field [0001] The invention relates to the technical field of translation, in particular to a method for processing document text. Background technique [0002] Since the mid-1980s, based on the extensive use of corpus and multi-engine machine translation methods, the performance and efficiency of translation software have been significantly improved, and various translation software have sprung up. Translating with pre-written software programs greatly improves the translation speed of texts. However, due to the particularity of language expression, the translation quality of translation software has been repeatedly criticized. The principle of translation software is to store the semantics of the two languages ​​in one-to-one correspondence, and replace them mechanically during translation. Due to the diversity of language expressions, each word , Words, phrases or single sentences often correspond to more than one meaning, and the translation obtained by complet...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/25G06F40/189
CPCG06F40/189G06F40/45G06F40/47G06F40/55
Inventor 张广睿
Owner 张广睿
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products