Unlock instant, AI-driven research and patent intelligence for your innovation.

System and method for an endpoint detection of speech for improved speech recognition in noisy environment

a speech recognition and endpoint detection technology, applied in the field of speech recognition, can solve the problems of low or no immunity from background noise, significant reduction of speech recognition accuracy, and high false rejection rate, and achieve the effect of improving speech recognition

Inactive Publication Date: 2008-01-24
MINDSPEED TECH INC
View PDF15 Cites 124 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Detecting when actual speech activity contained in an input speech signal begins and ends is a basic problem for all ASR systems, and it is well-recognized that proper detection is crucial for good speech recognition accuracy.
Thus, when actual speech activity is mistakenly classified by the endpointer as silence or non-speech, or when non-speech activity is erroneously grouped with speech, speech recognition accuracy is significantly diminished.
Another disadvantage of the conventional energy-based endpoint detection algorithm, such as the one utilized by endpoint detection system 100, is that it has little or no immunity to background noise.
Such errors lead to high false rejection rates, and reflect negatively on the overall performance of the ASR system.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for an endpoint detection of speech for improved speech recognition in noisy environment
  • System and method for an endpoint detection of speech for improved speech recognition in noisy environment
  • System and method for an endpoint detection of speech for improved speech recognition in noisy environment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] The present invention may be described herein in terms of functional block components and various processing steps. It should be appreciated that such functional blocks may be realized by any number of hardware components and / or software components configured to perform the specified functions. For example, the present invention may employ various integrated circuit components, e.g., memory elements, digital signal processing elements, logic elements, and the like, which may carry out a variety of functions under the control of one or more microprocessors or other control devices. Further, it should be noted that the present invention may employ any number of conventional techniques for speech recognition, data transmission, signaling, signal processing and conditioning, tone generation and detection and the like. Such general techniques that may be known to those skilled in the art are not described in detail herein.

[0023] It should be appreciated that the particular implem...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

According to a disclosed embodiment, an endpointer determines the background energy of a first portion of a speech signal, and a cepstral computing module extracts one or more features of the first portion. The endpointer calculates an average distance of the first portion based on the features. Subsequently, an energy computing module measures the energy of a second portion of the speech signal, and the cepstral computing module extracts one or more features of the second portion. Based on the features of the second portion, the endpointer calculates a distance of the second portion. Thereafter, the endpointer contrasts the energy of the second portion with the background energy of the first portion, and compares the distance of the second portion with the distance of the first portion. The second portion of the speech signal is classified by the endpointer as speech or non-speech based on the contrast and the comparison.

Description

RELATED APPLICATIONS [0001] The present application claims the benefit of U.S. provisional application Ser. No. 60 / 272,956, filed Mar. 2, 2001, which is hereby fully incorporated by reference in the present application.BACKGROUND OF THE INVENTION [0002] 1. Field of the Invention [0003] The present invention relates generally to the field of speech recognition and, more particularly, speech recognition in noisy environments. [0004] 2. Related Art [0005] Automatic speech recognition (“ASR”) refers to the ability to convert speech signals into words, or put another way, the ability of a machine to recognize human voice. ASR systems are generally categorized into three types: speaker-independent ASR, speaker-dependent ASR and speaker-verification ASR. Speaker-independent ASR can recognize a group of words from any speaker and allow any speaker to use the available vocabularies after having been trained for a standard vocabulary. Speaker-dependent ASR, on the other hand, can identify a v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(United States)
IPC IPC(8): G10L15/00
CPCG10L25/87
Inventor BOU-GHAZALE, SAHAR E.ASADI, AYAMAN O.ASSALEH, KHALED
Owner MINDSPEED TECH INC