Method and system for converting Chinese to Chinese pinyin containing polyphonic characters
A technology of Chinese pinyin and polyphonic characters, applied in the field of language translation, can solve the problems of low accuracy and low efficiency, and achieve the effect of improving accuracy and efficiency
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0050] Such as figure 1 Shown, the Chinese of the present embodiment containing polyphonic character converts the Chinese pinyin method, comprises the following steps:
[0051] Step 110, store polyphonic characters and their pinyin in the first lexicon, store phrases with polyphonic characters and their pinyin in the second lexicon, and store common Chinese characters and pinyin of non-polyphonic characters in the third lexicon.
[0052] Since polyphonic characters have multiple pronunciations, polyphonic characters and pinyin in the first lexicon have a one-to-many structure. And the pronunciation of polyphonic characters in phrases (comprising words, idioms, slang and commonly used expressions, etc.) is generally fixed, so polyphonic characters and pinyin in the second lexicon are one-to-one structures. And the first lexicon also includes the pinyin weights of the different pronunciations of each polyphonic character. The initial pinyin weights can be preset according to th...
Embodiment 2
[0087] In the present embodiment, the Chinese conversion Chinese pinyin system that contains polyphonic characters comprises: data storage unit 1, word segmentation unit 2, first judging unit 3, second judging unit 4, the 3rd judging unit 5, the first translation unit 6, the 3rd judging unit Two translation unit 7 and result output unit 8 . Wherein, the data storage unit includes a first lexicon, a second lexicon and a third lexicon, the first lexicon stores polyphonic characters and their pinyin, the second lexicon stores phrases with polyphonic characters and their pinyin, and the second lexicon stores polyphonic characters and their pinyin. What3thesaurus stores non-polyphonic characters and their pinyin.
[0088] When the user needs a translation, enter a string. The word segmentation unit splits the read character string into several word units, specifically, splitting based on the word segmentation method of word meaning. The first judging unit judges whether the numbe...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 

