Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Bilingual largest noun group separating-fusing translation method

A bilingual technology with maximum nouns, applied in special data processing applications, instruments, electronic digital data processing, etc., can solve the problems of MNP not being widely used and the large difference between Chinese and English languages, reduce the probability of achievement, and improve the translation accuracy rate , the effect of reducing the translation search space

Active Publication Date: 2014-07-23
BEIJING INSTITUTE OF TECHNOLOGYGY
View PDF0 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] Third, there are currently many researches on the recognition of Chinese-English MNP, but due to the relatively large language differences between Chinese and English, MNP has not been widely used in machine translation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bilingual largest noun group separating-fusing translation method
  • Bilingual largest noun group separating-fusing translation method
  • Bilingual largest noun group separating-fusing translation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0029] The specific implementation manners of the present invention will be described in further detail below in conjunction with the accompanying drawings and examples.

[0030] A statistical machine translation model for bilingual maximum noun chunk separation translation, comprising the following steps:

[0031] Step 1. Identify the BMNC and its head word of the sentence to be translated. First, a machine learning method is used to train a tagging model, and the sentences to be translated are tagged with BMNC and their central words.

[0032] In bilingual sentence pairs, Chinese-English maximum noun phrases (MNP) may have incomplete translations, for example:

[0033] [Shanghai Pudong Development and Legal System Construction] synchronously [The development of Shanghai's Pudong] is in step with [the establishment of its legal system]. [Shanghai Pudong Development and Legal System Construction] is recognized as a largest noun phrase on the Chinese side, while it is split i...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a bilingual largest noun group separating-fusing translation method, and belongs to the technical field of natural language processing in computer science. The BMNCs separating-fusing translation concept is introduced to a statistic machine translation model based on a tree, and sentence translation is converted into translation of all bilingual largest noun groups in a sentence and translation of a sentence framework. According to the bilingual largest noun group separating-fusing translation method, translation difficulty is lowered, translation time is shortened, translation accuracy is improved, and the bilingual largest noun group separating-fusing translation method is particularly suitable for long-sentence translation in instant translation.

Description

technical field [0001] The invention relates to a bilingual maximum noun block separation-integration translation method, which is especially suitable for the translation of longer sentences, and belongs to the technical field of natural language processing (NLP) in computer science. Background technique [0002] A maximum noun phrase (MNP) is a noun phrase that is not contained by any other noun phrase. From the perspective of syntactic function, MNP generally appears in the position of the subject or object of the sentence. If you can identify all the MNPs in the sentence well, you can easily grasp the overall structural framework of the sentence. As an important applied basic research, the automatic recognition and analysis of MNP has important practical significance for many applied research in the field of natural language processing, including syntax analysis, information retrieval, information extraction, machine translation, etc. [0003] In the field of computer la...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06F17/28G06F17/27
Inventor 黄河燕史树敏李业刚
Owner BEIJING INSTITUTE OF TECHNOLOGYGY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products