Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Deep processing method for characters of document

An in-depth processing and document technology, applied in the field of translation, can solve problems such as inconsistency in translation, increase the translation intensity of translators, and reduce translation efficiency, so as to achieve the effects of shortening the working time of translation, reducing the difficulty of typesetting, and reducing flexibility

Inactive Publication Date: 2016-07-13
张广睿 +1
View PDF9 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] In the existing technology, for a project or a long document, it is often divided into multiple parts in a team for translation, but due to the different translation habits of translators, different translators often translate a sentence with the same meaning, resulting in inconsistent translations Happening
In addition, this method of dividing the team into multiple translations causes the translators to repeatedly translate words, phrases or single sentences with the same meaning, which not only greatly increases the translation intensity of the translators, but also greatly reduces the translation efficiency

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep processing method for characters of document
  • Deep processing method for characters of document
  • Deep processing method for characters of document

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] A method for in-depth processing of document text, comprising the following steps:

[0038] (1) Extract the text information in the document to be translated, convert the document to be translated into a Word document or an Excel document, etc., and then process the extracted text information by clearing the format function or copying and replacing the function to unify the format of the text information, thereby Get a document in a uniform format, as attached figure 2 shown.

[0039] (2) Use one or a combination of line breaks, punctuation marks, spaces, etc. to automatically split the document after the unified format, and split it into any one of words, phrases, and single sentences Or several text data collections to be translated as the smallest unit, as shown in the attached image 3 shown. After splitting, classify the types of text, punctuation marks, numbers, letters, etc., and remove the non-translated text in the document, as shown in the attached Figur...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a deep processing method for characters of a document. The deep processing method comprises the following steps: (1) extracting character information of the document to be translated; (2) splitting the document into a data set of characters to be translated with minimum unit and eliminating data of non-translation characters and repeated characters to be translated; (3) establishing the document processed before translation, firstly replicating the data of the characters to be translated into an original text, then writing the original text and a translated text relating to special terms into corresponding special term lists, thus obtaining the document processed before translation and with special terms; (4) replacing character data of the original text in the special term list included in an original text list in the document processed before translation with character data of the translated text in the special term list, and secondarily processing the character data to obtain a final document processed before translation; (5) translating the original text list by a translator; and (6) replacing the original text with the translated text to obtain the translated text. The deep processing method disclosed by the invention has the advantages that repeated single contents in the document can be deeply eliminated in advance before the document is processed, so that the purpose of improving the translation efficiency can be achieved.

Description

technical field [0001] The invention relates to the technical field of translation, in particular to a method for in-depth processing of document text. Background technique [0002] Since the mid-1980s, based on the extensive use of corpus and multi-engine machine translation methods, the performance and efficiency of translation software have been significantly improved, and various translation software have sprung up. Translating with pre-written software programs greatly improves the translation speed of texts. However, due to the particularity of language expression, the translation quality of translation software has been repeatedly criticized. The principle of translation software is to store the semantics of the two languages ​​in one-to-one correspondence, and replace them mechanically during translation. Due to the diversity of language expressions, each word , Words, phrases or single sentences often correspond to more than one meaning, and the translation obtaine...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/21G06F17/22G06F17/27
CPCG06F40/103G06F40/131G06F40/247G06F40/58
Inventor 张广睿
Owner 张广睿
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products