Method and system for detecting phonetic frequency spectrum wave crest and phonetic identification

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition and wave crest technology, applied in the field of information processing, can solve the problems of increasing feature dimension, decreasing recognition performance, and not increasing too much speech feature dimension.

Inactive Publication Date: 2009-06-24

KK TOSHIBA

View PDF0 Cites 14 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Some peaks in the noisy speech spectrum are caused by noise, and if the noise-induced peaks are mistaken for speech, it will lead to poor recognition performance

[0010] (2) The dimensionality of speech features cannot be increased too much

At present, most of the robust front-ends using spectral peak information combine the features of purely using spectral peak information with the traditional Mel scale cepstral coefficient, so the feature dimension will increase

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0026] Various preferred embodiments of the present invention will be described in detail below with reference to the accompanying drawings.

[0027] First, the method of detecting the spectral peak of the speech of the present invention is described. The main idea of the method for detecting speech spectrum peaks of the present invention is to remove the noise peaks in the speech power spectrum by using the peak distance and the peak position constraints of adjacent frames, so as to detect reliable speech spectrum peaks.

[0028] figure 1 is a flowchart of a method for detecting spectral peaks of speech according to an embodiment of the present invention. Such as figure 1 As shown, firstly, at step 105, the speech power spectrum is enhanced by speech enhancement technology. For noisy speech signals, since the spectrum difference between noise and effective speech is not large in some cases, if the speech spectrum peak is detected directly, the detection result will not b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a method and a device for detecting voice frequency spectrum wave crest as well as a voice recognition method and a system thereof. The method for detecting voice frequency spectrum wave crest comprises the following steps: detecting voice frequency spectrum wave crest candidates from the power spectrum of the voice; eliminating noise wave crests in the voice frequency spectrum wave crest candidates according to the spaces between the wave crests and adjacent wave crest positions; and detecting the voice frequency spectrum wave crest. In the invention, during the detection of the voice frequency spectrum wave crest, the limitations of the spaces between the wave crests and adjacent wave crest positions are utilized to remove the noise wave crests. Furthermore, the acquired power value of the voice frequency spectrum wave crest replaces the entire power spectrum, and is used for extracting the mel cepstrum coefficient characteristics of voice, so as to increase the noise-proof robustness of voice recognition under the circumstances that voice characteristic dimension is not increased.

Description

technical field [0001] The present invention relates to information processing technology, in particular, to the detection of the spectral peak of speech and the speech recognition technology using the spectral peak information of speech. Background technique [0002] The goal of Automatic Speech Recognition (ASR) technology is to enable computers to recognize continuous speech spoken by people. Usually, the automatic speech recognition process includes two stages of template generation and matching recognition. In the template generation stage, a template for comparison is established according to the spectral characteristics of the sample speech; in the recognition stage, when the speaker's speech is input into the computer, the computer's automatic speech recognition system extracts the features of these speech and uses This is compared with the pre-stored voice template to find the most matching and closest voice sample, so as to know the meaning of the input voice, and...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/20G10L15/02G10L15/08

CPCG10L21/0208

Inventor赵蕤鄢翔丁沛何磊郝杰

OwnerKK TOSHIBA

Method and system for detecting phonetic frequency spectrum wave crest and phonetic identification

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology