Speech decoding method and device and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A voice decoding and decoding network technology, applied in the fields of devices, storage media, and voice decoding methods, can solve the problems of large memory and difficult decoding of high-level language models, improve decoding speed, save computing resources and storage resources, The effect of ensuring decoding accuracy

Active Publication Date: 2019-08-23

TENCENT TECH (SHENZHEN) CO LTD

View PDF6 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, the memory of the high-level language model is relatively large, and the memory of the decoding network generated based on the high-level language model is much larger than that of the high-level language model, which requires the configuration of a large number of storage resources and computing resources. In scenarios with limited computing resources, it is difficult to achieve decoding. Therefore, there is an urgent need for a speech decoding method that takes into account both decoding speed and decoding accuracy.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0026] In order to make the object, technical solution and advantages of the present invention clearer, the implementation manner of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0027] First, symbols involved in the present invention will be described.

[0028] : represents the empty symbol;

[0029] Ilabel: represents the input symbol;

[0030] Olable: represents the output symbol;

[0031] : represents the start symbol;

[0032] State.A: Indicates the state of the token in the first decoding network corresponding to the low-level language model;

[0033] State.B: Indicates the state of the token in the second decoding network corresponding to the differential language model.

[0034] Next, important terms involved in the present invention are explained.

[0035] 1. WFST (Weighted Finaite-State Transducer, weighted finite state machine) is used for large-scale speech recognition, and its ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech decoding method and device and a storage medium, and belongs to the technical field of speech recognition. The speech decoding method includes the steps that a targettoken corresponding to a minimum decoding score is obtained from a first token list, wherein the first token list includes the first token obtained by decoding a previous audio frame, and the first token includes state pairs formed by decoding in different decoding networks and the decoding scores thereof; pruning parameters when decoding a current audio frame is determined according to the targettoken and an acoustic vector of the current audio frame; and according to the first token list, the pruning parameters and the acoustic vector, the current audio frame is decoded. According to the speech decoding method and device and the storage medium, a decoding network corresponding to a high-order language model does not need to be generated, and decoding is performed based on the decoding networks corresponding to a low-level language model and a differential language model, so that computing resources and storage resources are saved under the premise of ensuring decoding accuracy; andthe current audio frame is decoded according to decoding results of the previous audio frame, so that the decoding speed is increased.

Description

technical field [0001] The invention relates to the technical field of voice recognition, in particular to a voice decoding method, device and storage medium. Background technique [0002] Speech recognition technology, also known as ASR (Automatic Speech Recognition, automatic speech recognition), its goal is to convert the vocabulary content in human speech into computer-readable input, including keystrokes, binary codes or character sequences, etc., so as to realize human-machine interact. Speech recognition technology has a wide range of application scenarios in modern life, and can be applied to scenarios such as car navigation, smart home, voice dialing, and simultaneous interpretation. The decoder is the core of the speech recognition system, and the speech decoding process based on the decoder plays an important role in the entire speech recognition process, directly affecting the accuracy of the recognition results. [0003] At present, the speech decoding process...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/05G10L15/06G10L15/14G10L15/16G10L15/18G10L15/22G10L15/26G10L19/008

CPCG10L15/22G10L15/063G10L15/142G10L15/16G10L15/18G10L15/05G10L15/1822G10L15/26G10L19/008G10L2015/223G10L15/197G10L2015/085G10L15/183G10L15/187G10L15/32G10L15/083G10L2015/088

Inventor黄羿衡简小征贺利强

OwnerTENCENT TECH (SHENZHEN) CO LTD

Speech decoding method and device and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology