Unlock instant, AI-driven research and patent intelligence for your innovation.

Speech recognition model training method, speech recognition method and system

A speech recognition model and technology to be recognized, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as lack of performance and insufficient language modeling ability, and achieve the effect of improving accuracy and modeling ability

Active Publication Date: 2022-04-01
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Although it introduces a language predictor, its language modeling ability is insufficient. After research, it is found that the language predictor does not play a role similar to a language model in real reasoning, but more assumes the function of eliminating duplicate labels. There is room for further improvement in the ability to model dependencies between languages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech recognition model training method, speech recognition method and system
  • Speech recognition model training method, speech recognition method and system
  • Speech recognition model training method, speech recognition method and system

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0043] The speech recognition model based on Transducer has been widely used at home and abroad. The model usually consists of three parts, namely an acoustic encoder, a language predictor and a joint network. The acoustic encoder is responsible for encoding the input acoustic features into an acoustic encoding state vector, the input of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The embodiment of the invention discloses a training method of a speech recognition model, a speech recognition method and a system, and relates to the technical field of speech recognition. This embodiment includes: inputting audio training samples into an acoustic encoder, encoding and representing the audio training samples, and determining the acoustic coding state vector; inputting a preset vocabulary into a language predictor to determine a text prediction vector; Input the text mapping layer to obtain the text output probability distribution; calculate the first loss function according to the target text sequence corresponding to the audio training sample and the text output probability distribution; input the text prediction vector and the acoustic encoding state vector into the joint network to calculate the second loss function , perform iterative optimization according to the first loss function and the second loss function until the stopping condition is satisfied. This embodiment adjusts the training and prediction process of the speech recognition model, improves the modeling capability of the semantic recognition model, and thus improves the accuracy of the speech recognition model.

Description

technical field [0001] The present application relates to the technical field of speech recognition, in particular to a speech recognition model training method, speech recognition method and system. Background technique [0002] The speech recognition model based on Transducer has been widely used at home and abroad, and its typical feature is that it can directly adapt to streaming speech recognition tasks. Although it introduces a language predictor, its language modeling ability is insufficient. After research, it is found that the language predictor does not play a role similar to a language model in real reasoning, but more assumes the function of eliminating duplicate labels. There is room for further improvement in the ability to model dependencies between languages. Contents of the invention [0003] In order to solve the above technical problems or at least partly solve the above technical problems, embodiments of the present invention provide a speech recogniti...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/06G10L15/22G10L15/26G10L19/16G10L25/03G10L25/24
CPCG10L15/063G10L15/26G10L15/22G10L25/03G10L25/24G10L19/16G10L15/16G06N3/045G06N3/0464G06N3/09G06F40/284G10L15/02G10L15/197
Inventor 陶建华田正坤易江燕
Owner INST OF AUTOMATION CHINESE ACAD OF SCI