Child speech recognition model training corpus screening method

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition model and training corpus technology, applied in speech recognition, speech synthesis, speech analysis, etc., can solve problems such as inconsistent quality of synthesized speech, unclear pronunciation, substandard speech of children, etc.

Active Publication Date: 2021-04-09

AISPEECH CO LTD

View PDF11 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, using synthetic speech generated by TTS systems trained on child speech data is problematic because child speech involves substandard or unclear articulation

As a result, the quality of synthesized speech is inconsistent in this case

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0021] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0022] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other.

[0023] The invention may be described in the general context of computer-executable instructions, such as program modules, being executed by a computer. Generally, progr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a child speech recognition model training corpus screening method. The method comprises the following steps: inputting a phoneme sequence and a child reference audio into a TTS synthesizer to obtain a plurality of synthesized audios; acquiring reference feature information of the child reference audio and a plurality of pieces of synthetic feature information of the plurality of synthetic audios; and screening the plurality of synthetic audios according to the reference feature information and the plurality of pieces of synthetic feature information. According to the child speech recognition model training corpus screening method provided by the invention, the TTS synthesizer is adopted to generate the synthetic audio, and the synthetic audio is screened according to the child reference audio used for generating the synthetic audio to obtain the high-quality synthetic audio, so that the corpus for training the child speech recognition model is expanded. Therefore, the invention solves the problem that a child corpus is difficult to collect, the quality of corpora in the child corpus is ensured, and a child speech recognition model with good performance can be trained.

Description

technical field [0001] The invention relates to the technical field of speech recognition, in particular to a method for screening children's speech recognition model training corpus, electronic equipment and a storage medium. Background technique [0002] The performance of automatic speech recognition (ASR) systems has improved significantly since the introduction of deep neural networks. With a large amount of training data and advanced model structures, ASR models are now able to achieve human-equal performance. However, to the best of our knowledge, speech recognition in children remains a challenging task despite many efforts. [0003] One challenge of speech recognition in children is the lack of data, as child corpora are difficult to collect. In addition, children have inherently high variability in physical and vocal features and expressions. To overcome these difficulties, channel length normalization is proposed to reduce the acoustic variability between speak...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G10L13/02G10L25/24G10L15/14G10L15/16

CPCG10L13/02G10L25/24G10L15/14G10L15/16

Inventor 钱彦旻王巍周之恺卢怡宙王鸿基杜晨鹏

Owner AISPEECH CO LTD

Who we serve

R&D Engineer
R&D Manager
IP Professional

Why Patsnap Eureka

Industry Leading Data Capabilities
Powerful AI technology
Patent DNA Extraction

Social media

Patsnap Eureka Blog

Learn More

PatSnap group products

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Child speech recognition model training corpus screening method

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology