Speech recognition method and device

A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of sudden change of context information and limited recognition effect, and achieve the effect of improving the effect.

Active Publication Date: 2014-03-26
BEIJING BAIDU NETCOM SCI & TECH CO LTD
View PDF6 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0008] However, in practical applications, because the pause in speech will affect the nearby acoustic pronunciation, the longer the pause time, the greater the impact. In addition, the context

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition method and device
  • Speech recognition method and device
  • Speech recognition method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be described in detail below with reference to the accompanying drawings and specific embodiments.

[0037] The process of speech recognition actually depends on the decoded network being trained, that is to say, speech recognition includes at least two processes: the first is the training process of the decoding network, and the second is based on the decoding network to recognize the speech. Voice recognition process. Among them, the voice recognition process of the voice to be recognized involves the query of the acoustic model and the query of the language model. The query of the acoustic model is based on the decoding network query of the acoustic model (the acoustic model used in the embodiment of the present invention includes HMM and sil models) In order to obtain the HMM state jump sequence of the voice to be recognized, the query of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a speech recognition method and a speech recognition device, wherein the method comprises the steps that: a context-dependent HMM(hidden markov model) is adopted when a decoding network is trained, a sil (silence) model is added to a suffix in the decoding network, acoustic contexts of the HMM state before and after the sil model are regulated, and the HMM state skip sequence of a to-be-recognized speech is acquired through the decoding network. Furthermore, a skip to the head part of a linguistic model is added at the end of the linguistic model in the decoding network to simulate the influence of a pause between sentences on the context information of the linguistic model. According to the speech recognition method and the speech recognition device, speech recognition effect is improved.

Description

【Technical Field】 [0001] The present invention relates to the field of computer application technology, in particular to a method and device for speech recognition. 【Background technique】 [0002] Speech recognition technology is a technology that allows machines to convert speech signals into corresponding text or commands through the process of recognition and understanding. The maturity and continuous improvement of Hidden Markov Model (HMM) technology has become the mainstream method of speech recognition. [0003] HMM is to establish a statistical model of the time series structure of the speech signal, which is regarded as a mathematical double random process: one is to use a Markov chain with a finite number of states to simulate the implicit random process of the statistical characteristics of the speech signal. , The other is the random process of the observation sequence associated with each state of the Markov chain. The former is expressed through the latter, but the s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/14
Inventor 钱胜
Owner BEIJING BAIDU NETCOM SCI & TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products