Deep belief network for large vocabulary continuous speech recognition

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A deep, video signal technology, used in speech recognition, speech analysis, biological neural network models, etc., can solve problems such as not making good use of hidden layers

Active Publication Date: 2012-04-11

MICROSOFT TECH LICENSING LLC

View PDF2 Cites 29 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

For example, training a feed-forward ANN using only backpropagation does not make good use of more than two hidden layers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0019] Various techniques related to automatic speech recognition (ASR) systems will now be described with reference to the drawings, wherein like reference numerals refer to like elements throughout. Additionally, several functional block diagrams of various example systems are shown and described herein for purposes of explanation; however, it will be appreciated that functions described as being performed by particular system components may be performed by multiple components. Similarly, for example, a component may be configured to perform functions described as being performed by multiple components, and some steps in methods described herein may be omitted, reordered, or combined.

[0020] refer to figure 1 , illustrates an example system 100 that facilitates performing ASR. System 100 includes speech recognition system 102 that receives samples 104 . The samples may be words spoken (eg captured by utilizing a microphone) from an individual over a certain amount of tim...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

A method is disclosed herein that includes an act of causing a processor to receive a sample, wherein the sample is one of spoken utterance, an online handwriting sample, or a moving image sample. The method also comprises the act of causing the processor to decode the sample based at least in part upon an output of a combination of a deep structure and a context-dependent Hidden Markov Model (HMM), wherein the deep structure is configured to output a posterior probability of a context-dependent unit. The deep structure is a Deep Belief Network consisting of many layers of nonlinear units with connecting weights between layers trained by a pretraining step followed by a fine-tuning step.

Description

technical field [0001] The present invention relates to deep trust networks for large vocabulary continuous speech recognition. Background technique [0002] Speech recognition has been the subject of extensive research and commercial development. For example, speech recognition systems have been incorporated into mobile phones, desktop computers, automobiles, etc. to provide specific responses to speech input provided by users. For example, in a mobile phone equipped with voice recognition technology, a user can speak the name of a contact listed on the mobile phone, and the mobile phone can initiate a call to that contact. In addition, many companies are currently using speech recognition technology to assist customers in identifying company employees, identifying problems with products or services, and the like. [0003] However, even after decades of research, the performance of automatic speech recognition (ASR) systems in real-world usage scenarios is still far from ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/14G10L15/16

CPCG10L15/14G06N3/084G06N3/047G06N3/044G06N3/045

InventorL·邓D·俞G·E·达尔

OwnerMICROSOFT TECH LICENSING LLC

Deep belief network for large vocabulary continuous speech recognition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology