Voice decoding method and device

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech decoding and speech technology, applied in the field of speech decoding, can solve the problems of slow speed and low precision, and achieve the effect of improving speed and precision and reducing the possibility of wrong clipping.

Active Publication Date: 2012-10-17

BEIJING BAIDU NETCOM SCI & TECH CO LTD

View PDF3 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The technical problem to be solved by the present invention is to provide a voice decoding method and device to solve the technical defects of slow speed and low precision in voice decoding in the prior art

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0024] Speech recognition usually includes the following parts: performing front-end processing on the speech, extracting the acoustic features of the speech, and decoding the speech feature stream formed after the feature extraction. The front-end processing of speech includes processing the original speech, partially eliminating the influence of noise and different speakers, so that the processed signal can better reflect the essential characteristics of speech. The most commonly used front-end processing includes endpoint detection and speech enhancement. In the stage of extracting the acoustic features of speech, the more commonly used acoustic features include linear prediction coefficient LPC, cepstral coefficient CEP and so on. There are ma...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice decoding method and device. The voice decoding method comprises the following steps of: A, obtaining a current voice characteristic frame from a voice characteristic flow to be decoded; B, utilizing the current voice characteristic frame to expand each current decoding path; C, utilizing a voice short-time stable characteristic to select more than one decoding path from each expanded decoding path to be used as the current decoding path; judging whether the voice characteristic flow to be decoded reaches to a final frame or not; if so, determining an optimal decoding path from the current each decoding path to be used as a result for decoding the voice characteristic flow to be decoded; and if not, taking a next frame of the voice characteristic flow to be decoded as the current voice characteristic frame, and returning back to the step B. With the adoption of the manner, the precision of voice decoding is improved.

Description

【Technical field】 [0001] The invention relates to speech recognition technology, in particular to a speech decoding method and device. 【Background technique】 [0002] Using the HMM (Hidden Markov Model, Hidden Markov Model) model for speech recognition has become a mainstream technology in speech recognition. HMM is a statistical model established for the time series structure of the speech signal. It regards the speech signal as a mathematical double stochastic process: one is to use a Markov chain with a finite number of states to simulate the hidden changes in the statistical characteristics of the speech signal. is a stochastic process containing , and the other is a random process of the sequence of observations associated with each state of the Markov chain. When using the HMM model for speech decoding, as the decoding process progresses, the number of decoding paths will increase geometrically. Therefore, in order to reduce the amount of calculation and speed up the ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L19/00

Inventor钱胜

OwnerBEIJING BAIDU NETCOM SCI & TECH CO LTD

Voice decoding method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology