Voice noise method and system for data enhancement

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A data and speech technology, applied in the field of data-enhanced speech noise addition method and system, can solve the problems of high cost of artificial noise addition, limited noise types and quantities, poor robustness of speech recognition model and poor generalization ability, etc., to achieve The effect of reducing time and calculation and improving robustness

Active Publication Date: 2019-09-06

AISPEECH CO LTD

View PDF5 Cites 19 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In order to at least solve the problem in the prior art that due to the limited types and quantities of artificially added noise, the trained speech recognition model has poor robustness and generalization ability, and at the same time, the cost of artificially added noise is relatively high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0026] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0027] Such as figure 1 Shown is a flow chart of a speech noise adding method for data enhancement provided by an embodiment of the present invention, including the following steps:

[0028] S11: Using the speaker vector of the noise-free frequency as the condition of the conditional variational self-encoding model, input the speaker vector of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The embodiment of the invention provides a voice noise adding method for data enhancement. The voice noise adding method includes the steps that speaker vectors of noise-free voice frequency and voicefrequency with noise are input into a conditional variational self-coding model, the vector mean and a variance vector outputted by a model encoder are sampled in a random gaussian distribution mode,and a noise implicit vector is obtained; the noise implicit vector and the noise-free voice frequency are input into the model, and simulation noise voice frequency is output by the model decoder; the model is based on the training conditions of the simulation noise voice frequency and the voice frequency with the noise, obtained varied noise implicit vectors are modeled, and the noise implicit variable space is obtained; and the noise implicit variable space is randomly sampled to be used as a noise adding implicit vector, the noise adding implicit vector and the voice frequency with the noise are input into the model decoder, and new voice frequency with the noise for data enhancement is obtained. The embodiment of the invention further provides a voice noise adding system for data enhancement. According to the voice noise adding method and system for data enhancement, the speaker vectors are modeled, through extraction of implicit space features, more varied noise data can be generated, and robustness of a voice recognition model is improved.

Description

technical field [0001] The invention relates to the field of voice recognition, in particular to a voice noise adding method and system for data enhancement. Background technique [0002] Over time, speech recognition technology has made great improvements, but when speech recognition is applied to environments with complex noise, it still has a certain impact on its recognition performance. In order to make the speech recognition model better applicable to various noise environments and improve the robustness and generalization ability of the speech recognition model to noise, it is usually trained with more noisy frequencies, because this way It is simple and effective, but it is difficult to obtain the noisy frequency suitable for training. For this reason, artificial noise is usually added to clean speech. For example, artificially collecting noise and then mixing the noise with clean speech yields more noisy frequencies suitable for training. [0003] In the process o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G10L15/06G10L15/07G10L15/20

CPCG10L15/063G10L15/07G10L15/20G10L2015/0631

Inventor俞凯钱彦旻吴章昊王帅

OwnerAISPEECH CO LTD

Voice noise method and system for data enhancement

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology