Reverse Chinese-English transliteration method and device thereof

A reverse, Hanyu Pinyin technology, applied in special data processing applications, instruments, electrical digital data processing, etc., can solve the problems of wrong syllable selection, syllable limiting factors are not obvious, and achieve the effect of improving the accuracy.

Inactive Publication Date: 2009-12-02
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 22 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Second, in the statistical transliteration model, the selection of syllables is carried out according to the pronunciation, and the limiting factors between syllables are not obvious, so selection bias is prone to occur
For example, in English, the syllable "c" and the syllable "k" have similar pronunciation rules, and it is easy to make mistakes in syllable selection when reverse transliterating "Clinton / Clinton"

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Reverse Chinese-English transliteration method and device thereof
  • Reverse Chinese-English transliteration method and device thereof
  • Reverse Chinese-English transliteration method and device thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0025] The method of the present invention will be described in further detail below in conjunction with the accompanying drawings and specific embodiments. It should be noted that the described embodiments are only intended to facilitate the understanding of the present invention, but do not have any limiting effect on it.

[0026] In order to solve the two difficult problems existing in Chinese-English reverse transliteration, the present invention uses network resources to verify the results of the statistical transliteration module 1 or directly extract target translations from web pages.

[0027] like figure 1 As shown in the flow chart of the Chinese-English reverse transliteration aided by network mining in the present invention, the premise of using the method for assisting Chinese-English reverse transliteration by network mining in the present invention is that an effective query can be constructed first, and secondly, the query can be mined to find Chinese-English b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a reverse Chinese-English transliteration method and a device thereof. A Chinese transliteration name to be translated is converted into a pinyin sequence, and a statistics transliteration module is used for generating transliteration candidates; the transliteration candidates are revised into real English words by a revision module, and the revision module uses real English words collected from a great quantity of webpages to form a vocabulary; revised real English words are used to be inquiry to verify translation results, and webpage resources obtained with a search engine are used to rearrange the revised transliteration candidates; words which appear as a named entity role on the webpage are given high marks so as to filter common English words. The method can overcome the problems that a statistic model loses aphonic syllables or chooses wrong same pronouncing syllables in the process of transliteration and the like, and effectively improves precision rate of transliteration. Experiences prove that the precision rate of transliteration is improved by 17.55% in open beta.

Description

technical field [0001] The invention relates to the technical field of natural language processing, and relates to a method and a device for assisting Chinese-English reverse transliteration by means of network mining. Background technique [0002] Named entities include seven categories such as person names, place names, and institution names. Named entities convey important information in human language, and its recognition and translation are one of the key technologies in natural language processing research. In multilingual processing, the recognition and translation results of named entities directly affect the understanding of natural language. Transliteration refers to maintaining the approximation of pronunciation during translation from the source language to the target language. The vast majority of people's names are translated by transliteration, and transliteration is also an important part of place name translation and organization name translation. Therefo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F17/28G06F17/30
Inventor 赵军杨帆邹波
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products