Speech recognition system and method using noise padding and normalization in dynamic time warping
A speech recognition and noise technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as loss of efficiency, inaccurate end points of words, and inability to adopt
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0054] see image 3 The system of the present invention is shown. The system includes a feature extractor 50 , feature buffer 52 , voice activity detector (VAD) 54 , template database 56 , two feature transformers 58A and 58B, a comparison unit 60 and a decision unit 62 . According to the preferred embodiment of the present invention, the comparison unit 62 is a noise-adapted dynamic time warping (DTW) unit, and the system also includes a template filler 64, a wide language symbolizer 66, a noise and peak energy The estimator 68, and a gain and gain-to-noise adapter 70 will be described in detail below.
[0055] In operation, feature extractor 50 extracts features such as auto-correction coefficients or filter bank energies for each frame of the input signal and provides them to voice activity detector 54 and feature buffer 52 . The buffer 52 stores the characteristics of each frame in frame order, keeping records of these frames for a predetermined length of time. The voic...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 