Voice noise method and system for data enhancement

A data and speech technology, applied in the field of data-enhanced speech noise addition method and system, can solve the problems of high cost of artificial noise addition, limited noise types and quantities, poor robustness of speech recognition model and poor generalization ability, etc., to achieve The effect of reducing time and calculation and improving robustness

Active Publication Date: 2019-09-06
AISPEECH CO LTD
View PDF5 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In order to at least solve the problem in the prior art that due to the limited types and quantities of artificially added noise, the trained speech recognition model has poor robustness and generalization ability, and at the same time, the cost of artificially added noise is relatively high.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Voice noise method and system for data enhancement
  • Voice noise method and system for data enhancement
  • Voice noise method and system for data enhancement

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0026] In order to make the purpose, technical solutions and advantages of the embodiments of the present invention clearer, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0027] Such as figure 1 Shown is a flow chart of a speech noise adding method for data enhancement provided by an embodiment of the present invention, including the following steps:

[0028] S11: Using the speaker vector of the noise-free frequency as the condition of the conditional variational self-encoding model, input the speaker vector of th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The embodiment of the invention provides a voice noise adding method for data enhancement. The voice noise adding method includes the steps that speaker vectors of noise-free voice frequency and voicefrequency with noise are input into a conditional variational self-coding model, the vector mean and a variance vector outputted by a model encoder are sampled in a random gaussian distribution mode,and a noise implicit vector is obtained; the noise implicit vector and the noise-free voice frequency are input into the model, and simulation noise voice frequency is output by the model decoder; the model is based on the training conditions of the simulation noise voice frequency and the voice frequency with the noise, obtained varied noise implicit vectors are modeled, and the noise implicit variable space is obtained; and the noise implicit variable space is randomly sampled to be used as a noise adding implicit vector, the noise adding implicit vector and the voice frequency with the noise are input into the model decoder, and new voice frequency with the noise for data enhancement is obtained. The embodiment of the invention further provides a voice noise adding system for data enhancement. According to the voice noise adding method and system for data enhancement, the speaker vectors are modeled, through extraction of implicit space features, more varied noise data can be generated, and robustness of a voice recognition model is improved.

Description

technical field [0001] The invention relates to the field of voice recognition, in particular to a voice noise adding method and system for data enhancement. Background technique [0002] Over time, speech recognition technology has made great improvements, but when speech recognition is applied to environments with complex noise, it still has a certain impact on its recognition performance. In order to make the speech recognition model better applicable to various noise environments and improve the robustness and generalization ability of the speech recognition model to noise, it is usually trained with more noisy frequencies, because this way It is simple and effective, but it is difficult to obtain the noisy frequency suitable for training. For this reason, artificial noise is usually added to clean speech. For example, artificially collecting noise and then mixing the noise with clean speech yields more noisy frequencies suitable for training. [0003] In the process o...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/06G10L15/07G10L15/20
CPCG10L15/063G10L15/07G10L15/20G10L2015/0631
Inventor 俞凯钱彦旻吴章昊王帅
Owner AISPEECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products