Robust voice recognition method based on acoustic model array

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
An acoustic model and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as being easily covered by noise, adversely affected by model adaptation, and unable to provide effective speech recognition, achieving enhanced robustness, The effect of reducing impact and improving accuracy

Active Publication Date: 2015-03-04

HOHAI UNIV

View PDF3 Cites 24 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the energy of the high-frequency part of the speech spectrum is small, and it is easily covered by noise in a noisy environment. Therefore, in a noisy test environment, the high-frequency part of the noisy speech spectrum is noise components, which not only cannot provide effective effects for speech recognition , and will adversely affect the model adaptation of the backend

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0018] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0019] like figure 1 As shown, the robust speech recognition method based on the acoustic model array includes the following steps:

[0020] 1. Training voice upper limit frequency setting:

[0021] Let the highest frequency of speech in the training speech library be f max , first convert it to the Mel frequency domain:

[0022] F max = 2595 log 10 ( 1 + ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a robust voice recognition method based on an acoustic model array. The robust voice recognition method comprises a training phase and a testing phase. At the training phase, a plurality of upper limiting frequencies are set for training voice according to the highest frequency of the voice, a plurality of groups of characteristic vectors are extracted and model training is performed to obtain the acoustic model array. At the testing phase, firstly, the upper limiting frequency of test voice is estimated according to a small quantity of self-adaptive voice in the testing environment; secondly, an acoustic model matched with the upper limiting frequency of the test voice is selected from the acoustic model array, and the parameters of the acoustic model are adjusted to obtain a testing environment acoustic model; finally, characteristic extraction is performed according to the upper limiting frequency of the test voice so as to obtain a characteristic vector of the noise-containing test voice, and acoustic decoding is performed on the characteristic vector by use of the testing environment acoustic model to obtain an identification result. The robust voice recognition method based on the acoustic model array is capable of improving the performance of a voice recognition system in a noise environment and improving the robustness of the system.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, and specifically relates to extracting multiple sets of feature vectors in different frequency ranges according to a plurality of speech upper limit frequencies, constructing an array of acoustic models, and compensating the acoustic model matched with the upper limit frequency of the test speech to improve speech recognition. A model adaptation method for identifying system robustness. Background technique [0002] In the practical application of the speech recognition system, due to the influence of speech variability such as environmental noise, the pre-trained acoustic model often does not match the feature parameters extracted in the test environment, which will lead to a serious decline in the performance of the speech recognition system. Therefore, it is necessary to compensate the environment mismatch to improve the recognition performance of the speech recognition system. [...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/14G10L15/20

Inventor吕勇

OwnerHOHAI UNIV

Robust voice recognition method based on acoustic model array

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology