Robust voice recognition method based on acoustic model array

An acoustic model and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as being easily covered by noise, adversely affected by model adaptation, and unable to provide effective speech recognition, achieving enhanced robustness, The effect of reducing impact and improving accuracy

Active Publication Date: 2015-03-04
HOHAI UNIV
View PDF3 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the energy of the high-frequency part of the speech spectrum is small, and it is easily covered by noise in a noisy environment. Therefore, in a noisy test environment, the high-frequency part of the noisy speech spectrum is noise components, which not only cannot provide effective effects for speech recognition , and will adversely affect the model adaptation of the backend

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Robust voice recognition method based on acoustic model array
  • Robust voice recognition method based on acoustic model array
  • Robust voice recognition method based on acoustic model array

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0019] like figure 1 As shown, the robust speech recognition method based on the acoustic model array includes the following steps:

[0020] 1. Training voice upper limit frequency setting:

[0021] Let the highest frequency of speech in the training speech library be f max , first convert it to the Mel frequency domain:

[0022] F max = 2595 log 10 ( 1 + ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a robust voice recognition method based on an acoustic model array. The robust voice recognition method comprises a training phase and a testing phase. At the training phase, a plurality of upper limiting frequencies are set for training voice according to the highest frequency of the voice, a plurality of groups of characteristic vectors are extracted and model training is performed to obtain the acoustic model array. At the testing phase, firstly, the upper limiting frequency of test voice is estimated according to a small quantity of self-adaptive voice in the testing environment; secondly, an acoustic model matched with the upper limiting frequency of the test voice is selected from the acoustic model array, and the parameters of the acoustic model are adjusted to obtain a testing environment acoustic model; finally, characteristic extraction is performed according to the upper limiting frequency of the test voice so as to obtain a characteristic vector of the noise-containing test voice, and acoustic decoding is performed on the characteristic vector by use of the testing environment acoustic model to obtain an identification result. The robust voice recognition method based on the acoustic model array is capable of improving the performance of a voice recognition system in a noise environment and improving the robustness of the system.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, and specifically relates to extracting multiple sets of feature vectors in different frequency ranges according to a plurality of speech upper limit frequencies, constructing an array of acoustic models, and compensating the acoustic model matched with the upper limit frequency of the test speech to improve speech recognition. A model adaptation method for identifying system robustness. Background technique [0002] In the practical application of the speech recognition system, due to the influence of speech variability such as environmental noise, the pre-trained acoustic model often does not match the feature parameters extracted in the test environment, which will lead to a serious decline in the performance of the speech recognition system. Therefore, it is necessary to compensate the environment mismatch to improve the recognition performance of the speech recognition system. [...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/14G10L15/20
Inventor 吕勇
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products