A Robust Speech Recognition Method Based on Acoustic Model Array

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
An acoustic model and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as being easily covered by noise, adverse effects of model self-adaptation, and inability to provide effective effects for speech recognition, so as to achieve enhanced robustness, The effect of reducing influence and improving accuracy

Active Publication Date: 2017-11-24

HOHAI UNIV

View PDF3 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the energy of the high-frequency part of the speech spectrum is small, and it is easily covered by noise in a noisy environment. Therefore, in a noisy test environment, the high-frequency part of the noisy speech spectrum is noise components, which not only cannot provide effective effects for speech recognition , and will adversely affect the model adaptation of the backend

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0018] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0019] Such as figure 1 As shown, the robust speech recognition method based on the acoustic model array includes the following steps:

[0020] 1. Training voice upper limit frequency setting:

[0021] Let the highest frequency of speech in the training speech library be f max , first convert it to the Mel frequency domain:

[0022]

[0023] Among them, F max Indicates the highest frequency in the Mel frequency domain. Then, according to F max Set the upper limit frequency of N speech spe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a robust speech recognition method based on an acoustic model array, which includes a training phase and a testing phase. In the training phase, multiple upper frequency limits are set for the training voice according to the highest frequency of the voice, multiple sets of feature vectors are extracted, and model training is performed to obtain an array of acoustic models. In the test phase, firstly, based on a small amount of adaptive speech in the test environment, the upper limit frequency of the test speech is estimated; then the acoustic model matching the upper limit frequency of the test speech is selected from the acoustic model array, and its parameters are adjusted to obtain the test environment acoustics Finally, feature extraction is performed according to the upper limit frequency of the test speech to obtain the feature vector of the noisy test speech, and the acoustic decoding is performed with the test environment acoustic model to obtain the recognition result. The invention can improve the performance of the speech recognition system in the noise environment and improve the robustness of the system.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, and specifically relates to extracting multiple sets of feature vectors in different frequency ranges according to a plurality of speech upper limit frequencies, constructing an array of acoustic models, and compensating the acoustic model matched with the upper limit frequency of the test speech to improve speech recognition. A model adaptation method for identifying system robustness. Background technique [0002] In the practical application of the speech recognition system, due to the influence of speech variability such as environmental noise, the pre-trained acoustic model often does not match the feature parameters extracted in the test environment, which will lead to a serious decline in the performance of the speech recognition system. Therefore, it is necessary to compensate the environment mismatch to improve the recognition performance of the speech recognition system. [...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/14G10L15/20

Inventor吕勇

OwnerHOHAI UNIV

A Robust Speech Recognition Method Based on Acoustic Model Array

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology