Training method and system for children's speech recognition model

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A speech recognition model and speech training technology, applied in speech recognition, speech analysis, components of TV systems, etc., can solve the problems of insufficient data volume of children's voices and poor performance of children's speech recognition models, and achieve high recognition accuracy. Effect

Active Publication Date: 2021-12-14

AISPEECH CO LTD

View PDF7 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0009] In order to at least solve the problem in the prior art that the children's speech recognition model is not effective due to the insufficient amount of children's voice data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0032] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0033] Such as figure 1 Shown is a flow chart of a training method for a children's speech recognition model provided by an embodiment of the present invention, including the following steps:

[0034] S11: Obtain training data, the training data includes children's voice training data, hard labels corresponding to the children's voice training da...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

An embodiment of the present invention provides a training method for a child's speech recognition model. The method includes: obtaining training data; obtaining an unconditional generation confrontation network through baseline acoustic model training; inputting random noise data into the unconditional generation confrontation network to obtain noise-enhanced acoustic features; inputting noise-enhanced acoustic features to the baseline acoustic model to obtain each frame The posterior probability soft label corresponding to the noise-enhanced acoustic feature; at least the noise-enhanced acoustic feature and the soft label, as well as the child's voice training data and the hard label are used as sample training data to train the child's voice-enhanced acoustic recognition model. The embodiment of the present invention also provides a training system for children's speech recognition model. The embodiments of the present invention change the pronunciation nature of the children's voices when the children's voices are limited, generate diversified children's voices, and improve the recognition accuracy of the children's voice recognition model.

Description

technical field [0001] The invention relates to the field of speech recognition, in particular to a training method and system for a children's speech recognition model. Background technique [0002] With the development of intelligent voice, it not only provides a large number of voice interactive products for adult users, but also provides many intelligent products for children, such as intelligent story machines and intelligent robots. However, due to the differences between children's voices and adult voices, existing speech recognition systems are not effective in recognizing children's voices. [0003] For the above problems, the method of adding noise is usually used: in data preprocessing, for each sentence in the clean audio of children's voices, use the FaNT tool to add random one of 115 noises with a signal-to-noise ratio of 20dB to increase the number of children's voices amount of data; or, adopt random feature mapping method: random feature mapping learns an a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): G10L15/02G10L15/06G10L15/26G10L15/20

CPCG10L15/063G10L15/02G10L15/26H04N23/71

Inventor钱彦旻吴松泽俞凯盛佩瑶杨卓林李晨达

OwnerAISPEECH CO LTD

Training method and system for children's speech recognition model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology