Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and apparatus for constructing speech decoding network in digital speech recognition

A digital speech and speech decoding technology, applied in speech recognition, speech analysis, instruments, etc., can solve the problems of low recognition accuracy, limited recognition accuracy, slow recognition speed, etc., to improve the recognition accuracy and speed up the convergence speed. Effect

Active Publication Date: 2016-08-17
TENCENT TECH (SHENZHEN) CO LTD +1
View PDF10 Cites 54 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the recognition object of this technology includes not only numbers, but also other language content, which makes the acoustic model and language model used in this technology too complex, the recognition speed is relatively slow, and it is easy to misrecognize numbers as other polyphonic characters , so that the recognition accuracy of digital speech is not high enough
Even if the recognition objects of the language model in this technology are limited to ten numbers from 0 to 9, the improvement of recognition accuracy is still limited
[0005] It can be seen that the speech decoding network built for digital speech recognition still has the problem of low recognition accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and apparatus for constructing speech decoding network in digital speech recognition
  • Method and apparatus for constructing speech decoding network in digital speech recognition
  • Method and apparatus for constructing speech decoding network in digital speech recognition

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] Exemplary embodiments embodying the features and advantages of the present invention will be described in detail in the following description. It should be understood that the present invention can have various changes in different embodiments without departing from the scope of the present invention, and the descriptions and drawings therein are essentially used for illustration rather than limitation this invention.

[0028] As mentioned above, in digital speech recognition, existing speech decoding networks can be divided into two categories: one is isolated word recognition technology, and the other is general continuous speech recognition technology.

[0029] On the one hand, if figure 1 As shown, in the speech decoding network constructed based on the isolated word recognition technology, the starting position of the input digital voice is first judged by endpoint detection, and then the digital voice that confirms the starting position is divided into multiple v...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a method and an apparatus for constructing a speech decoding network in digital speech recognition. The method includes the following steps: acquiring training data obtained from digital speech recording, the training data including a plurality of speech sections; extracting acoustic features from the training data so as to obtain a feature sequence which corresponding with each speech section; based on the feature sequences and phonemes corresponding with digits in the training data, conducting progressive training beginning with a single phoneme acoustic model to obtain an acoustic model; acquiring a language model, constructing a speech decoding network through the language model and the acoustic model obtained in the training, the language model being obtained by modeling matching relationship of digits in the training data. According to the invention, the method and the apparatus can effectively increase recognition accuracy of digital speech.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a method and device for constructing a speech decoding network in digital speech recognition. Background technique [0002] In digital speech recognition, the existing speech decoding networks can be divided into two categories: one is to use isolated word recognition technology to recognize numbers in speech; the other is to use general continuous speech recognition technology to process numbers in speech. identify. [0003] In digital speech recognition based on isolated word recognition technology, it is required to input digital speech with a clear interval between numbers. If it is continuous digital input, it may lead to unrecognized or incorrect recognition, which greatly reduces the recognition accuracy of digital speech. , digital speech recognition based on isolated word recognition technology has obvious limitations. [0004] Thus, general continuous speec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/02G10L15/14
CPCG10L15/02G10L15/144G10L2015/0631G10L15/063G10L15/187G10L2015/025G10L15/04G10L15/14G10L15/142G10L25/24G10L25/90
Inventor 吴富章钱柄桦李为李科吴永坚黄飞跃
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products