Chinese-English bilingual speech recognition method based on phoneme confusion

A speech recognition, Chinese and English technology, applied in speech recognition, speech analysis, instruments, etc., can solve unreliable problems and achieve the effect of model scale reduction

Active Publication Date: 2009-06-03
INST OF ACOUSTICS CHINESE ACAD OF SCI +1
View PDF0 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In bilingual recognition, the main problem is that the speaker will intersperse the second language in the process of speaking the mother tongue, and the inserted second language has the pronunciation characteristics of the speaker's mother tongue (nonnative)
The main problem here is that the log-likelihood criterion is a clustering criterion based on the observation probability of the same speech feature vector sequence under different phoneme Gaussian models, but in fact only the observation probability under the Gaussian model determines whether the similarity between two phonemes is correct. reliable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Chinese-English bilingual speech recognition method based on phoneme confusion
  • Chinese-English bilingual speech recognition method based on phoneme confusion
  • Chinese-English bilingual speech recognition method based on phoneme confusion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] figure 1 It is the basic principle block diagram of the Chinese-English bilingual recognition system based on the two-pass phoneme clustering algorithm TCM. It describes the core components of the phoneme clustering algorithm. It is mainly composed of several parts: Chinese / English voice, Chinese / English model, mandatory Alignment, decoding, phoneme confusion matrix generation, and Chinese-English mixed model training. figure 2 It is a flow chart of the specific implementation of the Chinese-English bilingual recognition system based on the two-pass phoneme clustering algorithm TCM.

[0042] Combine below figure 1 as well as figure 2 The specific embodiment of the present invention is described in further detail:

[0043] The core technology of the Chinese-English bilingual recognition system based on the two-pass phoneme clustering algorithm TCM involved in the present invention lies in the two-pass phoneme clustering algorithm TCM (modules 1 to 11). TCM is a n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a Chinese-English bilingual speech recognition method based on phoneme confusion. The method comprises the following steps: unifying a Chinese-English phoneme set by adopting a twice phoneme clustering method, obtaining a Chinese-English mixed acoustic model by retraining, correcting corresponding bilingual pronunciation dictionaries, and realizing Chinese-English bilingual recognition by a decoder based on the preceding steps. The Chinese recognition rate of a Chinese-English bilingual speech recognition system based on TCM phoneme confusion can be comparable to that of an independent Chinese speech recognition system; on the premise that English data with Chinese accent is unavailable and standard Chinese-English training data are used only, compared with the relatively independent English speech recognition system, recognition of English fragments with the Chinese accent is obviously improved; meanwhile, the Chinese-English bilingual speech recognition system based on the TCM phoneme confusion also has a better recognition performance than the existing common bilingual recognition system which performs phoneme clustering by virtue of a logarithm likelihood criterion, and has very high practicability.

Description

technical field [0001] The present invention relates to a bilingual speech recognition method, more specifically, the present invention relates to a Chinese-English bilingual recognition method based on a two-pass phone clustering method based on Confusion Matrix (TCM: Two-pass phone clustering method based on Confusion Matrix). Background technique [0002] With the globalization of information in modern society, bilingual and multilingual communication has become more and more common, which brings new challenges to speech recognition technology. In bilingual recognition, the main problem is that the speaker will intersperse the second language in the process of speaking the mother tongue, and the inserted second language has the pronunciation characteristics of the speaker's mother tongue (nonnative). How to achieve and improve the recognition of the second language with the pronunciation characteristics of the mother tongue under the premise of ensuring the speech recogni...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/00
Inventor 颜永红张晴晴潘接林
Owner INST OF ACOUSTICS CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products