Method and apparatus for constructing speech decoding network in digital speech recognition

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A digital speech and speech decoding technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low recognition accuracy, limited recognition accuracy, slow recognition speed, etc., to improve the recognition accuracy and speed up the convergence speed. Effect

Active Publication Date: 2016-08-17

TENCENT TECH (SHENZHEN) CO LTD +1

View PDF10 Cites 54 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the recognition object of this technology includes not only numbers, but also other language content, which makes the acoustic model and language model used in this technology too complex, the recognition speed is relatively slow, and it is easy to misrecognize numbers as other polyphonic characters , so that the recognition accuracy of digital speech is not high enough

Even if the recognition objects of the language model in this technology are limited to ten numbers from 0 to 9, the improvement of recognition accuracy is still limited

[0005] It can be seen that the speech decoding network built for digital speech recognition still has the problem of low recognition accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0027] Exemplary embodiments embodying the features and advantages of the present invention will be described in detail in the following description. It should be understood that the present invention can have various changes in different embodiments without departing from the scope of the present invention, and the descriptions and drawings therein are essentially used for illustration rather than limitation this invention.

[0028] As mentioned above, in digital speech recognition, existing speech decoding networks can be divided into two categories: one is isolated word recognition technology, and the other is general continuous speech recognition technology.

[0029] On the one hand, if figure 1 As shown, in the speech decoding network constructed based on the isolated word recognition technology, the starting position of the input digital voice is first judged by endpoint detection, and then the digital voice that confirms the starting position is divided into multiple v...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method and an apparatus for constructing a speech decoding network in digital speech recognition. The method includes the following steps: acquiring training data obtained from digital speech recording, the training data including a plurality of speech sections; extracting acoustic features from the training data so as to obtain a feature sequence which corresponding with each speech section; based on the feature sequences and phonemes corresponding with digits in the training data, conducting progressive training beginning with a single phoneme acoustic model to obtain an acoustic model; acquiring a language model, constructing a speech decoding network through the language model and the acoustic model obtained in the training, the language model being obtained by modeling matching relationship of digits in the training data. According to the invention, the method and the apparatus can effectively increase recognition accuracy of digital speech.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a method and device for constructing a speech decoding network in digital speech recognition. Background technique [0002] In digital speech recognition, the existing speech decoding networks can be divided into two categories: one is to use isolated word recognition technology to recognize numbers in speech; the other is to use general continuous speech recognition technology to process numbers in speech. identify. [0003] In digital speech recognition based on isolated word recognition technology, it is required to input digital speech with a clear interval between numbers. If it is continuous digital input, it may lead to unrecognized or incorrect recognition, which greatly reduces the recognition accuracy of digital speech. , digital speech recognition based on isolated word recognition technology has obvious limitations. [0004] Thus, general continuous speec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/02G10L15/14

CPCG10L15/02G10L15/144G10L2015/0631G10L15/063G10L15/187G10L2015/025G10L15/04G10L15/14G10L15/142G10L25/24G10L25/90

Inventor吴富章钱柄桦李为李科吴永坚黄飞跃

OwnerTENCENT TECH (SHENZHEN) CO LTD

Method and apparatus for constructing speech decoding network in digital speech recognition

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology