Automatic speech recognition channel normalization
A normalization, speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as uncertainty limitation, and achieve the effect of improving robustness, reducing the volume of speech, and reducing system delay
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0027] A processing system for automatic speech recognition channel normalization includes offline processing and online processing to generate normalization parameters. The system is configured to utilize observations about properties of communication channels. For example, the following observations about speakers and parts of the communication channel - including room, microphone and ambient noise - can be made:
[0028] • The long-term spectrum of a speaker can be characterized mainly by two parameters: the overall loudness and the spectral tilt which describes the overall slope of the spectrum. The spectral tilt is a direct result of the ratio of the time the glottis remains open versus closed during each pitch period. The spectral tilt is typically -12dB / octave, although the ratio varies slightly across speakers and their vocal effort (normal, shouting). In the cepstral domain, overall loudness is captured by 0-order cepstral coefficients, while spectral tilt is captur...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 