Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A Robust Speech Recognition Method Based on Acoustic Model Array

An acoustic model and speech recognition technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as being easily covered by noise, adverse effects of model self-adaptation, and inability to provide effective effects for speech recognition, so as to achieve enhanced robustness, The effect of reducing influence and improving accuracy

Active Publication Date: 2017-11-24
HOHAI UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the energy of the high-frequency part of the speech spectrum is small, and it is easily covered by noise in a noisy environment. Therefore, in a noisy test environment, the high-frequency part of the noisy speech spectrum is noise components, which not only cannot provide effective effects for speech recognition , and will adversely affect the model adaptation of the backend

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A Robust Speech Recognition Method Based on Acoustic Model Array
  • A Robust Speech Recognition Method Based on Acoustic Model Array
  • A Robust Speech Recognition Method Based on Acoustic Model Array

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] Below in conjunction with specific embodiment, further illustrate the present invention, should be understood that these embodiments are only used to illustrate the present invention and are not intended to limit the scope of the present invention, after having read the present invention, those skilled in the art will understand various equivalent forms of the present invention All modifications fall within the scope defined by the appended claims of the present application.

[0019] Such as figure 1 As shown, the robust speech recognition method based on the acoustic model array includes the following steps:

[0020] 1. Training voice upper limit frequency setting:

[0021] Let the highest frequency of speech in the training speech library be f max , first convert it to the Mel frequency domain:

[0022]

[0023] Among them, F max Indicates the highest frequency in the Mel frequency domain. Then, according to F max Set the upper limit frequency of N speech spe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a robust speech recognition method based on an acoustic model array, which includes a training phase and a testing phase. In the training phase, multiple upper frequency limits are set for the training voice according to the highest frequency of the voice, multiple sets of feature vectors are extracted, and model training is performed to obtain an array of acoustic models. In the test phase, firstly, based on a small amount of adaptive speech in the test environment, the upper limit frequency of the test speech is estimated; then the acoustic model matching the upper limit frequency of the test speech is selected from the acoustic model array, and its parameters are adjusted to obtain the test environment acoustics Finally, feature extraction is performed according to the upper limit frequency of the test speech to obtain the feature vector of the noisy test speech, and the acoustic decoding is performed with the test environment acoustic model to obtain the recognition result. The invention can improve the performance of the speech recognition system in the noise environment and improve the robustness of the system.

Description

technical field [0001] The invention belongs to the technical field of speech recognition, and specifically relates to extracting multiple sets of feature vectors in different frequency ranges according to a plurality of speech upper limit frequencies, constructing an array of acoustic models, and compensating the acoustic model matched with the upper limit frequency of the test speech to improve speech recognition. A model adaptation method for identifying system robustness. Background technique [0002] In the practical application of the speech recognition system, due to the influence of speech variability such as environmental noise, the pre-trained acoustic model often does not match the feature parameters extracted in the test environment, which will lead to a serious decline in the performance of the speech recognition system. Therefore, it is necessary to compensate the environment mismatch to improve the recognition performance of the speech recognition system. [...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/14G10L15/20
Inventor 吕勇
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products