Methods and apparatus for automatic speech recognition

A technology of automatic speech recognition and grammar, which is applied in speech recognition, speech analysis, instruments, etc., and can solve problems such as caller annoyance

Inactive Publication Date: 2006-06-07
NUANCE COMM INC
View PDF0 Cites 51 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Having to repeat their...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Methods and apparatus for automatic speech recognition
  • Methods and apparatus for automatic speech recognition
  • Methods and apparatus for automatic speech recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Overview of Automatic Speech Recognition

[0028] Such as figure 1 The illustrated automatic speech recognition (ASR) system includes an input device 100, such as a conventional microphone or a telephone handset, an audio front end (AFE) 101 that receives input from the input device, a speech recognition engine 102 that receives input from the AFE, and connected Speech-response application 103 to the speech recognition engine. The application 103 defines a set of logical steps to be executed as part of the interaction between the user and the ASR system. The application 103 typically recognizes what input the user requires through user prompts. User prompts can be text strings displayed on the screen or audio clips played to the user. Speech-response applications use the results of the speech recognition engine to perform actions based on the input.

[0029] As a brief illustration, the following description refers to a possible account balance inquiry application. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

An automatic speech recognition (ASR) system that includes a speech-response application and a speech recognition engine. The ASR system generates user prompts to elicit certain spoken input, and when the spoken input is recognized, the voice-response application performs an action. The recognition engine compares sounds in the input audio signal to phonemes in the acoustic model to identify candidate matching phonemes. A recognition confidence score is calculated for each candidate matching phoneme, and the confidence score is used to aid in the recognition of one or more possible matching phoneme sequences that appear to match the word in the speech-response application's grammar. A confidence score for each phoneme is evaluated against predefined confidence score criteria (eg, a discrimination score below a "low confidence" threshold), and the results of the evaluation are used to influence the selection of subsequent user prompts. One such system uses confidence scores to select cues for object recognition training - encouraging input to be recognized as phonemes with low confidence recognition scores. Another system selects prompts to block input of sounds that are not easily recognizable.

Description

technical field [0001] The present invention provides methods and apparatus for automatic speech recognition. Background technique [0002] An automatic speech recognition (ASR) system takes an audio signal as input and compares the input signal to known sounds (phonemes) and sound sequences (trajectories), usually from an acoustic model (AM), to identify words that appear to match spoken sequences of sounds . After one or more words corresponding to the input audio signal are recognized, the text or other machine-readable form of the recognized matching words is returned by the ASR to an application such as an interactive voice response (IVR) phone application. A confidence score may be returned with each apparently matching word, based on how closely the incoming speech segment fits the average probability distribution associated with the phonemes in the ASR system's acoustic model. Multiple possible words and their respective confidence scores can be returned for select...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/28G10L15/00G10L15/08
CPCG10L2015/085G10L2015/025G10L15/08
Inventor B·J·皮克林T·D·波尔特尼B·T·斯塔尼福德M·惠特伯恩
Owner NUANCE COMM INC
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products