Method and system for detecting phonetic frequency spectrum wave crest and phonetic identification

A speech recognition and wave crest technology, applied in the field of information processing, can solve the problems of increasing feature dimension, decreasing recognition performance, and not increasing too much speech feature dimension.

Inactive Publication Date: 2009-06-24
KK TOSHIBA
View PDF0 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Some peaks in the noisy speech spectrum are caused by noise, and if the noise-induced peaks are mistaken for speech, it will lead to poor recognition performance
[0010] (2) The dimensionality of speech features cannot be increased too much
At present, most of the robust front-ends using spectral peak information combine the features of purely using spectral peak information with the traditional Mel scale cepstral coefficient, so the feature dimension will increase

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for detecting phonetic frequency spectrum wave crest and phonetic identification
  • Method and system for detecting phonetic frequency spectrum wave crest and phonetic identification
  • Method and system for detecting phonetic frequency spectrum wave crest and phonetic identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] Various preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0027] First, the method of detecting the spectral peak of the speech of the present invention is described. The main idea of ​​the method for detecting speech spectrum peaks of the present invention is to remove the noise peaks in the speech power spectrum by using the peak distance and the peak position constraints of adjacent frames, so as to detect reliable speech spectrum peaks.

[0028] figure 1 is a flowchart of a method for detecting spectral peaks of speech according to an embodiment of the present invention. Such as figure 1 As shown, firstly, at step 105, the speech power spectrum is enhanced by speech enhancement technology. For noisy speech signals, since the spectrum difference between noise and effective speech is not large in some cases, if the speech spectrum peak is detected directly, the detection result will not b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method and a device for detecting voice frequency spectrum wave crest as well as a voice recognition method and a system thereof. The method for detecting voice frequency spectrum wave crest comprises the following steps: detecting voice frequency spectrum wave crest candidates from the power spectrum of the voice; eliminating noise wave crests in the voice frequency spectrum wave crest candidates according to the spaces between the wave crests and adjacent wave crest positions; and detecting the voice frequency spectrum wave crest. In the invention, during the detection of the voice frequency spectrum wave crest, the limitations of the spaces between the wave crests and adjacent wave crest positions are utilized to remove the noise wave crests. Furthermore, the acquired power value of the voice frequency spectrum wave crest replaces the entire power spectrum, and is used for extracting the mel cepstrum coefficient characteristics of voice, so as to increase the noise-proof robustness of voice recognition under the circumstances that voice characteristic dimension is not increased.

Description

technical field [0001] The present invention relates to information processing technology, in particular, to the detection of the spectral peak of speech and the speech recognition technology using the spectral peak information of speech. Background technique [0002] The goal of Automatic Speech Recognition (ASR) technology is to enable computers to recognize continuous speech spoken by people. Usually, the automatic speech recognition process includes two stages of template generation and matching recognition. In the template generation stage, a template for comparison is established according to the spectral characteristics of the sample speech; in the recognition stage, when the speaker's speech is input into the computer, the computer's automatic speech recognition system extracts the features of these speech and uses This is compared with the pre-stored voice template to find the most matching and closest voice sample, so as to know the meaning of the input voice, and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/20G10L15/02G10L15/08
CPCG10L21/0208
Inventor 赵蕤鄢翔丁沛何磊郝杰
Owner KK TOSHIBA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products