Voice identification method and apparatus

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition and speech signal technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of easy confusion and poor recognition performance, and achieve the effect of reducing decoding paths, improving accuracy and improving decoding speed.

Active Publication Date: 2016-04-27

BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

View PDF5 Cites 20 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, based on the state modeling method, in the process of speech recognition, when recognizing between two pronunciation units, it is easy to be confused and the recognition performance is poor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0022] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0023] In the description of the present invention, it should be understood that the term "plurality" refers to two or more than two; sex.

[0024] The speech recognition method and device according to the embodiments of the present invention will be described below with reference to the accompanying drawings.

[0025] A speech recognition method, comprising the following steps: receiving a speech signal; decoding the speech signal according to a pre-established acoustic model, a language model and a decoding network, and dynamically ad...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a voice identification method and apparatus. The voice identification method comprises the following steps of receiving a voice signal; performing decoding on the voice signal according to a pre-established acoustic model, a linguistic model and a decoding network, and dynamically adding a blank unit in the decoding process to obtain an optimized decoding route with the added blank unit, wherein the acoustic model is obtained based on connectionist temporal classification training; the acoustic model comprises a basic pronouncing unit and the blank unit; the decoding network consists of multiple decoding routes formed by the basic pronouncing unit; and outputting the optimized decoding route to be used as the identification result of the voice signal. According to the voice identification method, the accuracy of voice identification can be improved, and the decoding speed in the identification process is improved.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice recognition method and device. Background technique [0002] Most of the traditional speech recognition technologies are based on state modeling speech recognition models for speech recognition. For example, speech recognition is performed based on a Hidden Markov Model (Hidden Markov Model; hereinafter referred to as: HMM). HMM can be regarded as a double stochastic process in mathematics: one is to use the Markov chain with finite number of states to simulate the implicit random process of the statistical characteristics of the speech signal, and the other is to use the Markov chain with each A stochastic process of a state-associated sequence of observations. In this modeling method, a phoneme or a syllable is considered to be divided into multiple states without physical meaning, and then a discrete or continuous Gaussian model or a deep learning model is ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/06G10L19/008G10L15/26

CPCG10L15/06G10L15/063G10L15/26G10L19/008G10L2015/0631G10L15/08G10L15/14G10L15/183G10L15/02

Inventor钱胜潘复平

OwnerBAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Voice identification method and apparatus

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology