Sentence-level parallel corpus making method and system realized based on man-machine collaboration concept

A human-machine collaboration, parallel corpus technology, applied in natural language translation, digital data processing, natural language data processing, etc., can solve problems such as deviation of actual meaning and inaccurate translation, and achieve perfect speed and accuracy.

Pending Publication Date: 2021-11-12
路米科技江苏有限公司
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] Existing Chinese-English mutual translation software generally directly translates words to obtain translation results. In this way, the translation results often have a large deviation from the actual meaning of the original text, and the translation is inaccurate.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sentence-level parallel corpus making method and system realized based on man-machine collaboration concept
  • Sentence-level parallel corpus making method and system realized based on man-machine collaboration concept
  • Sentence-level parallel corpus making method and system realized based on man-machine collaboration concept

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0033] Before setting forth the embodiments of the present invention, the nouns used in this embodiment are explained;

[0034] Explanation of terms: Parallel / corresponding corpus (parallel corpora) is a bilingual / multilingual corpus composed of the original text and its parallel corresponding target language text, and its alignment can be at the word level, sentence level, paragraph level, and article level.

[0035] It should be noted that, in addition to processing Chinese and English, the present invention can also process other bilingual languages, such as Chinese and Japanese, Chinese and Korean, Japanese and Korean, etc. This embodiment uses Chinese and English as examples for illustration. Furthermore, it can also handle multi-ethnic bilinguals, such as Chinese and Tibetan, Chinese and Uyghur, etc.

[0036] Please refer to Figure 1-3 In the first aspect, the embodiment of the present invention provides a method for making sentence-level parallel corpus based on the con...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sentence-level parallel corpus making method and system realized based on a man-machine collaboration concept, and relates to the technical field of sentence-level parallel corpus making. The method comprises the following steps: loading a Chinese and English contrast article to a source data textbox; respectively scanning and identifying Chinese and English sentences in the source data textbox; automatically copying the currently recognized Chinese and English sentences to a Chinese and English sentence textbox; checking the current Chinese and English sentence in the Chinese and English sentence textbox manually to obtain a manual judgment result, and inputting the manual judgment result into the man-machine cooperation system; and entering next sentence processing. Through the man-machine cooperation system, human initiative and computer regularity can be integrated. Finally, perfect fusion of speed and accuracy is achieved.

Description

technical field [0001] The present invention relates to the technical field of sentence-level parallel corpus production, in particular, to a sentence-level parallel corpus production method and system based on the concept of man-machine collaboration. Background technique [0002] With the development of the world economy, there are more and more phenomena of multi-country and multi-ethnic cooperation. It is particularly important to master multi-language in cooperation and communication, such as English, but because people's language environment is not necessarily English, non-English environment It will lead to the phenomenon of forgetting the English learned to varying degrees, but now many scenes need to read English documents, so the role of Chinese-English translation software is particularly important. [0003] Existing Chinese-English mutual translation software generally directly translates words to obtain translation results. In this way, the translation results o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F40/211G06F40/279G06F40/58
CPCG06F40/211G06F40/279G06F40/58
Inventor 祁智恒
Owner 路米科技江苏有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products