Voice recognition model training method and device, electronic equipment and storage medium

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition model and training method technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as time-consuming and expensive, and achieve the effect of improving training efficiency and effect and optimizing neural network model

Pending Publication Date: 2020-06-30

THE FOURTH PARADIGM BEIJING TECH CO LTD

View PDF8 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] However, since the entire training process takes a long time, and currently the number of training rounds and steps are estimated by manual experience to determine whether the alignment label is completed, the cost of adjusting parameters during the entire training process, especially adjusting the parameters of the alignment label will be very high. Big

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0033] In order to more clearly understand the above objects, features and advantages of the present disclosure, the present disclosure will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the described embodiments are some of the embodiments of the present disclosure, but not all of the embodiments. The specific embodiments described here are only used to explain the present disclosure, but not to limit the present disclosure. All other embodiments obtained by persons of ordinary skill in the art based on the described embodiments of the present disclosure belong to the protection scope of the present disclosure.

[0034] It should be noted that in this article, relative terms such as "first" and "second" are only used to distinguish one entity or operation from another entity or operation, and do not necessarily require or imply these No such actual relationship or order exists between entities...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention relates to a voice recognition model training method and device, electronic equipment and a storage medium, and is applied to a DNN-HMM voice recognition framework. Themethod includes the steps of obtaining voice data; extracting features of the voice data; performing multi-branch alignment labeling on the voice data based on the features; based on the multi-branchalignment annotation, selecting an alignment annotation result; based on the selected alignment labeling result, performing total training on the neural network to obtain a neural network model; andobtaining a voice recognition model based on the neural network model and the language model. In the embodiment of the invention, through multi-branch alignment annotation and selection of the alignment annotation result, the trained neural network model is optimized, manual intervention is not needed, and the training efficiency and effect are improved.

Description

technical field [0001] Embodiments of the present disclosure relate to the technical field of speech recognition, and in particular to a speech recognition model training method, device, electronic equipment, and storage medium. Background technique [0002] Speech recognition technology is a technology for machines to convert speech signals into corresponding text or commands through the process of recognition and understanding. At present, the training of the speech recognition model under the DNN-HMM speech recognition framework includes three parts: feature extraction, alignment labeling, and neural network training. Among them, the alignment labeling is completed through feature transformation and alignment training, and the determined input and Output; Neural network training is based on input and output, and the neural network model is obtained after the neural network training; the final model obtained by combining the neural network model and the language model is t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/06G10L15/02G10L15/14G10L15/16G10L15/18

CPCG10L15/063G10L15/02G10L15/16G10L15/144G10L15/18Y02T10/40

Inventor王靖淞涂威威

OwnerTHE FOURTH PARADIGM BEIJING TECH CO LTD

Voice recognition model training method and device, electronic equipment and storage medium

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology