Voice identification method and voice identification system

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition and speech technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of fusion decoder, different decoding space organization, and no way to decode space, etc., and achieve the effect of improving recognition accuracy.

Active Publication Date: 2013-09-25

BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

View PDF6 Cites 37 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Due to the different organization of the decoding space of the two recognition methods, there is no way to directly integrate the two decoding spaces into one decoder.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] Embodiments of the invention will now be described in detail, examples of which are illustrated in the accompanying drawings, wherein like reference numerals refer to like parts throughout. The embodiments are described below in order to explain the present invention by referring to the figures. Also, descriptions of well-known functions and constructions will be omitted for clarity and conciseness.

[0034] figure 1 is a flowchart illustrating a speech recognition method according to an exemplary embodiment of the present invention.

[0035] refer to figure 1, in step S101, receive speech input and extract speech frame features. For example, for a 10-second speech, there will be 1000 frame features. Here, the methods for receiving speech input and extracting frame features can be implemented by various methods in the prior art, and will not be repeated here.

[0036] In step S102, speech decoding is performed on the input speech by using the decoding space to dete...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

Disclosed are a voice identification method and a voice identification system. The voice identification method comprises the steps of receiving voice input and extracting a voice frame characteristic; conducting voice decoding on input voice by utilizing decoding space to ensure a voice decoding result. The decoding space comprises multiple decoding paths constructed on the basis of syntax rules, the multiple decoding paths comprise three types of decoding paths, wherein one type of decoding path only comprises language type module nodes, another type of decoding path only comprises statistical language module nodes, the third type of decoding path comprises the language type module nodes and the statistical language module nodes, and a semantic parsing result is determined by recalling the nodes on the selected decoding paths. The voice decoding comprises the steps of enabling the input voice to traverse each decoding path in the decoding space, selecting a decoding path with the largest sum of a language layer score and an acoustic layer score, and determining the voice decoding result according to a triphone acoustic model of the nodes on the selected decoding path.

Description

technical field [0001] The present invention relates to speech recognition technology, more specifically, relates to a speech recognition method and a speech recognition system that realize the integration of sound recognition and semantic understanding by combining recognition based on statistical language model and recognition based on grammatical rules. Background technique [0002] With the development of information technology, speech recognition technology has entered people's life. In the existing common speech recognition technology, the commonly used recognition method is recognition based on statistical language model (Ngram), or recognition based on grammatical rules (grammer). The recognition based on the statistical language model is to combine all the speech layer information into an Ngram language model, and the recognition result is carried out on the decoding space composed of the Ngram model. The recognition based on grammatical rules organizes the languag...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/02G10L15/14

Inventor 贾磊万广鲁

Owner BAIDU ONLINE NETWORK TECH (BEIJIBG) CO LTD

Voice identification method and voice identification system

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology