Context-sensitive Chinese speech recognition modeling method

A technology of speech recognition and modeling methods, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as large memory, difficulty in loading embedded devices, and large model size

Active Publication Date: 2005-08-17
PANASONIC CORP
View PDF2 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The speech recognition system disclosed in Chinese Patent Publication CN1264468A adopts the context-dependent phoneme modeling method. Although the acoustic model established in this way has high precision, the volume of the model is relatively large, and it is difficult to directly load it into the embedded device. In memory, it is difficult to meet the actual application needs of embedded devices
[0007] The problem in the above-mentioned published patents is that the required memory is relatively large, which is not suitable for use in embedded devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Context-sensitive Chinese speech recognition modeling method
  • Context-sensitive Chinese speech recognition modeling method
  • Context-sensitive Chinese speech recognition modeling method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0019] Firstly, the basic principle of speech recognition will be described below.

[0020] Speech recognition includes two basic processes, namely the training process and the recognition process. The main task of the training process is to use a large number of speech training samples to establish an acoustic model to describe the knowledge of the acoustic layer. In a complex recognition system, it is also necessary to use a large amount of text corpus to train a language model to describe language-level knowledge. In the recognition process, the acoustic model and language model obtained in the training process are used to decode the speech sample to be tested and recognize it as text. The technical innovation described in this patent mainly focuses on the acoustic model training process in the training phase.

[0021] As a language, Chinese has its own unique language characteristics. Using these characteristics for acoustic model modeling can reduce the size of the mode...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

This invention relates to context-dependent Chinese phone identifying and modeling method, which applies initial consonant right-dependent and final sound left dependent modeling method including: a, creating a context-dependent basic modeling unit by relating the initial consonant with the adjacent right final sound and relating the final sound with its adjacent left initial consonant, b, utilizing the state clustering method to train the model parameters to get an initial HMM, c, utilizing the sub-space clustering method to compress the HMM to generate a final model.

Description

technical field [0001] The invention relates to a voice recognition modeling method, in particular to a context-dependent Chinese acoustic modeling method that can be applied to embedded devices. Background technique [0002] Speech recognition technology is a technology that allows machines to convert voice signals into corresponding text or commands through the process of recognition and understanding. The combination of speech recognition technology and speech synthesis technology can enable people to get rid of the keyboard, operate through voice commands, and communicate with machines by voice. In the past two decades, with the rapid development of computer technology, speech recognition technology has made remarkable progress, and it has begun to move from the laboratory to the market. It is expected that in the next 10 years, speech recognition technology will enter various fields such as industry, home appliances, communications, automotive electronics, medical care...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/14
Inventor 贾磊马龙
Owner PANASONIC CORP
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products