Context-sensitive Chinese speech recognition modeling method

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A technology of speech recognition and modeling methods, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as large memory, difficulty in loading embedded devices, and large model size

Active Publication Date: 2005-08-17

PANASONIC CORP

View PDF2 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The speech recognition system disclosed in Chinese Patent Publication CN1264468A adopts the context-dependent phoneme modeling method. Although the acoustic model established in this way has high precision, the volume of the model is relatively large, and it is difficult to directly load it into the embedded device. In memory, it is difficult to meet the actual application needs of embedded devices

[0007] The problem in the above-mentioned published patents is that the required memory is relatively large, which is not suitable for use in embedded devices

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0019] Firstly, the basic principle of speech recognition will be described below.

[0020] Speech recognition includes two basic processes, namely the training process and the recognition process. The main task of the training process is to use a large number of speech training samples to establish an acoustic model to describe the knowledge of the acoustic layer. In a complex recognition system, it is also necessary to use a large amount of text corpus to train a language model to describe language-level knowledge. In the recognition process, the acoustic model and language model obtained in the training process are used to decode the speech sample to be tested and recognize it as text. The technical innovation described in this patent mainly focuses on the acoustic model training process in the training phase.

[0021] As a language, Chinese has its own unique language characteristics. Using these characteristics for acoustic model modeling can reduce the size of the mode...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

This invention relates to context-dependent Chinese phone identifying and modeling method, which applies initial consonant right-dependent and final sound left dependent modeling method including: a, creating a context-dependent basic modeling unit by relating the initial consonant with the adjacent right final sound and relating the final sound with its adjacent left initial consonant, b, utilizing the state clustering method to train the model parameters to get an initial HMM, c, utilizing the sub-space clustering method to compress the HMM to generate a final model.

Description

technical field [0001] The invention relates to a voice recognition modeling method, in particular to a context-dependent Chinese acoustic modeling method that can be applied to embedded devices. Background technique [0002] Speech recognition technology is a technology that allows machines to convert voice signals into corresponding text or commands through the process of recognition and understanding. The combination of speech recognition technology and speech synthesis technology can enable people to get rid of the keyboard, operate through voice commands, and communicate with machines by voice. In the past two decades, with the rapid development of computer technology, speech recognition technology has made remarkable progress, and it has begun to move from the laboratory to the market. It is expected that in the next 10 years, speech recognition technology will enter various fields such as industry, home appliances, communications, automotive electronics, medical care...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/14

Inventor贾磊马龙

OwnerPANASONIC CORP

Context-sensitive Chinese speech recognition modeling method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology