A machine translation method and device based on feature stem extraction

A machine translation and translation technology, applied in the field of machine translation, can solve problems such as low accuracy rate and low quality of cross-language text translation

Active Publication Date: 2020-06-12
DONGHUA UNIV
View PDF2 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] The purpose of the present invention is to overcome the defects of low quality and low accuracy in the translation of cross-lingual texts in the prior art, and provide a kind of feature sentence stem extraction that is accurate, has a small amount of processing, and has good quality and high accuracy in translating cross-language texts. Machine translation method and device based on characteristic sentence stem extraction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A machine translation method and device based on feature stem extraction
  • A machine translation method and device based on feature stem extraction
  • A machine translation method and device based on feature stem extraction

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0091] The present invention will be further described below in combination with specific embodiments. It should be understood that these examples are only used to illustrate the present invention and are not intended to limit the scope of the present invention. In addition, it should be understood that after reading the teachings of the present invention, those skilled in the art can make various changes or modifications to the present invention, and these equivalent forms also fall within the scope defined by the appended claims of the present application.

[0092] A machine translation method based on characteristic sentence stem extraction, the specific steps are as follows:

[0093] (1) Establish a characteristic sentence stem database, the steps are as follows figure 1 Shown:

[0094] 1.1) Obtain multi-word sequences in the language A corpus:

[0095] First, obtain the uncoded language A text corpus, and assign part-of-speech codes to the text; then linearly segment t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a machine translation method and device based on characteristic sentence stem extraction. Specifically, the method comprises the steps of 1), obtaining multiword sequences ina language A corpus, and identifying the sequences of which structures satisfy a sentence stem demand; 2), on the basis of internal adhesive force, external boundary independence and text distributiondomains, determining characteristic sentence stems, and screening the characteristic sentence stems based on a MIN-MAX normalization algorithm and a local maximum duplication elimination method; 3),translating the characteristic sentence stems to obtain a characteristic sentence stem database; and 4), inputting a to-be-translated language A text, extracting sentence stems sentence by sentence, searching sentence stem translations in the characteristic sentence stem database, translating words and expressions except for the sentence stems, and combining the translations of the words and expressions with the sentence stem translations according to word orders of a target language B, thereby obtaining translations. The device comprises a characteristic sentence stem database unit, a language input unit, a sentence stem extraction unit, a sentence stem identification unit, a translation unit and a combination unit. The machine translation method and device provided by the invention are high in translation efficiency and short in processing time and have wide application prospect.

Description

technical field [0001] The invention belongs to the field of machine translation, and relates to a machine translation method and device based on characteristic sentence stem extraction, in particular to a machine translation method and device based on a corpus to extract characteristic sentence stems. Background technique [0002] From early dictionary matching to rule-based translation of dictionaries combined with linguistic expert knowledge, to corpus-based statistical machine translation, with the improvement of computer computing power and the explosive growth of multilingual information, machine translation technology has gradually stepped out of the ivory tower and started Provide real-time and convenient translation services for ordinary users. [0003] The method of machine translation based on corpus has become the main direction of research in the field of machine translation. It is against this background that the corpus-driven translation equivalence research ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06F40/58
CPCG06F40/58
Inventor 李晶洁胡文杰
Owner DONGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products