Unlock instant, AI-driven research and patent intelligence for your innovation.

Realization method and realization system for simultaneously identifying bilingual terms and word alignment

A realization method and word alignment technology, applied in natural language translation, special data processing applications, instruments, etc., to achieve the effect of improving the performance of term recognition and word alignment, improving the performance of bilingual term and word alignment, and improving the quality of machine translation translation

Inactive Publication Date: 2017-05-10
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to solve the above-mentioned problems in the prior art, that is, in order to solve the problems of automatic term recognition and word alignment performance, and improve the quality of the final machine translation translation, the present invention provides a method for simultaneously recognizing bilingual terms and word alignment by humans

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Realization method and realization system for simultaneously identifying bilingual terms and word alignment
  • Realization method and realization system for simultaneously identifying bilingual terms and word alignment
  • Realization method and realization system for simultaneously identifying bilingual terms and word alignment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0065] Preferred embodiments of the present invention are described below with reference to the accompanying drawings. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the protection scope of the present invention.

[0066] Such as figure 2 As shown, the present invention simultaneously recognizes the implementation method of bilingual term and word alignment comprising:

[0067] Step 100: Segment a pair of source language sentences and target language sentences to obtain source language phrases and target language phrases;

[0068] Step 200: performing word alignment on the source language phrase and the target language phrase, and obtaining an initial alignment word from the source language sentence to the target sentence;

[0069] Step 300: Respectively identify terms in the source language sentence and the target language sentence to obtain an initi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a realization method and a realization system for simultaneously identifying bilingual terms and word alignment. The realization method comprises the steps of performing word segmentation on a pair of a source language sentence and a target language sentence to obtain a source language word group and a target language word group; performing word alignment on the source language word group and the target language word group to obtain aligned initial words; identifying terms in the source language sentence and the target language sentence, and obtaining initial monolingual terms; performing term alignment in combination with the aligned initial words and the initial monolingual terms to obtain aligned initial terms; taking the aligned initial terms as anchor points, and obtaining a primary bilingual term candidate list; performing bilingual term identification on the primary bilingual term candidate list to obtain a secondary bilingual term candidate list; and performing secondary bilingual term identification and word alignment on the secondary bilingual term candidate list to obtain final bilingual terms and final aligned words. According to the realization method, automatic term identification and word alignment performance can be realized and final machine translated text quality is improved.

Description

technical field [0001] The present invention relates to the technical field of natural language processing, and more specifically, to an implementation method and implementation system for simultaneously recognizing bilingual terms and word alignment. Background technique [0002] Machine translation is the use of computers to convert between different languages. The translated language is usually called the source language, and the resulting language translated into is called the target language. Machine translation is the process of converting from a source language to a target language. Word alignment is a core task of statistical machine translation. It discovers mutually translated language fragments from bilingual parallel corpora, and is the main source of translation knowledge. In short, word alignment is the translation of a word in the source language sentence from which word in the target language. Such as figure 1 As shown, a word can be translated into one o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28
CPCG06F40/44G06F40/45G06F2216/03
Inventor 张家俊黄国平周玉宗成庆
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More