Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech decoding method and device and storage medium

A voice decoding and decoding network technology, applied in the fields of devices, storage media, and voice decoding methods, can solve the problems of large memory and difficult decoding of high-level language models, improve decoding speed, save computing resources and storage resources, The effect of ensuring decoding accuracy

Active Publication Date: 2019-08-23
TENCENT TECH (SHENZHEN) CO LTD
View PDF6 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the memory of the high-level language model is relatively large, and the memory of the decoding network generated based on the high-level language model is much larger than that of the high-level language model, which requires the configuration of a large number of storage resources and computing resources. In scenarios with limited computing resources, it is difficult to achieve decoding. Therefore, there is an urgent need for a speech decoding method that takes into account both decoding speed and decoding accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech decoding method and device and storage medium
  • Speech decoding method and device and storage medium
  • Speech decoding method and device and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0027] First, symbols involved in the present invention will be described.

[0028] : represents the empty symbol;

[0029] Ilabel: represents the input symbol;

[0030] Olable: represents the output symbol;

[0031] : represents the start symbol;

[0032] State.A: Indicates the state of the token in the first decoding network corresponding to the low-level language model;

[0033] State.B: Indicates the state of the token in the second decoding network corresponding to the differential language model.

[0034] Next, important terms involved in the present invention are explained.

[0035] 1. WFST (Weighted Finaite-State Transducer, weighted finite state machine) is used for large-scale speech recognition, and its ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a speech decoding method and device and a storage medium, and belongs to the technical field of speech recognition. The speech decoding method includes the steps that a targettoken corresponding to a minimum decoding score is obtained from a first token list, wherein the first token list includes the first token obtained by decoding a previous audio frame, and the first token includes state pairs formed by decoding in different decoding networks and the decoding scores thereof; pruning parameters when decoding a current audio frame is determined according to the targettoken and an acoustic vector of the current audio frame; and according to the first token list, the pruning parameters and the acoustic vector, the current audio frame is decoded. According to the speech decoding method and device and the storage medium, a decoding network corresponding to a high-order language model does not need to be generated, and decoding is performed based on the decoding networks corresponding to a low-level language model and a differential language model, so that computing resources and storage resources are saved under the premise of ensuring decoding accuracy; andthe current audio frame is decoded according to decoding results of the previous audio frame, so that the decoding speed is increased.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice decoding method, device and storage medium. Background technique [0002] Speech recognition technology, also known as ASR (Automatic Speech Recognition, automatic speech recognition), its goal is to convert the vocabulary content in human speech into computer-readable input, including keystrokes, binary codes or character sequences, etc., so as to realize human-machine interact. Speech recognition technology has a wide range of application scenarios in modern life, and can be applied to scenarios such as car navigation, smart home, voice dialing, and simultaneous interpretation. The decoder is the core of the speech recognition system, and the speech decoding process based on the decoder plays an important role in the entire speech recognition process, directly affecting the accuracy of the recognition results. [0003] At present, the speech decoding process...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/05G10L15/06G10L15/14G10L15/16G10L15/18G10L15/22G10L15/26G10L19/008
CPCG10L15/22G10L15/063G10L15/142G10L15/16G10L15/18G10L15/05G10L15/1822G10L15/26G10L19/008G10L2015/223G10L15/197G10L2015/085G10L15/183G10L15/187G10L15/32G10L15/083G10L2015/088
Inventor 黄羿衡简小征贺利强
Owner TENCENT TECH (SHENZHEN) CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products