GAN-based speech confrontation sample generation method

A technology against samples and samples, applied in speech analysis, neural learning methods, biological neural network models, etc., can solve the problems of target speech recognition network recognition errors, no guarantee of generated speech quality, and inability to guarantee the speech quality of confrontation samples, etc., to achieve Improve efficiency and ensure voice quality

Active Publication Date: 2020-07-31
NINGBO UNIV
View PDF10 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, Alantot only considers that the adversarial samples can successfully attack the target network, and does not guarantee the quality of the generated speech; the method adopted by Carlini must first convert the speech into MFCC, then modify the MFCC features through the gradient information returned by the speech recognition network, and finally convert the MFCC features Reconstructed into a speech signal, although this can make the target speech recognition network recognize errors, but it cannot guarantee the speech quality of the confrontation sample

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • GAN-based speech confrontation sample generation method
  • GAN-based speech confrontation sample generation method
  • GAN-based speech confrontation sample generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0034] Embodiments of the present invention are described in detail below, examples of which are shown in the drawings, wherein the same or similar reference numerals designate the same or similar elements or elements having the same or similar functions throughout. The embodiments described below by referring to the figures are exemplary only for explaining the present invention and should not be construed as limiting the present invention.

[0035] Such as Figures 1 to 3 Shown is a schematic structural diagram of a preferred embodiment of the present invention. The present invention adopts a GAN-based voice confrontation sample generation method, utilizes the game idea of ​​generative confrontation network, and designs a reasonable loss function to train a disturbance generator, through which the disturbance generator can quickly construct a voice with better quality and a successful attack Speech adversarial samples with high rate. Such as figure 1 Shown is a diagram of...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a GAN-based speech confrontation sample generation method, which is characterized by comprising the steps of preprocessing an original speech data sample x, inputting the preprocessed original speech data sample x into a generator G to obtain an adversarial disturbance G(x), and using a formula (1) to construct an adversarial sample, the formula (1) being xadv = x + G(x),inputting the adversarial sample xadv into a discriminator D, and inputting the adversarial sample xadv into a target network f after the adversarial sample xadv passes through a Mel-frequency cepstrum coefficient MFCC feature extractor, calculating the loss lf of the target network, the adversarial loss lGAN of the discriminator, the hinge loss lhinge, the mean square error loss l2 and the loss lD of the discriminator, thereby obtaining a loss function l when the generator G is trained, S4, updating parameters of a generator and a discriminator through gradient back propagation of the loss function l obtained in the S4, obtaining an optimal generator through a formula (10), loading an original sample x into the optimal generator obtained in the S5 through the formula (10), and constructing to obtain a corresponding adversarial sample. Thus, the minimum disturbance can be effectively generated, and the speech quality can be ensured.

Description

technical field [0001] The present invention relates to the field of speech technology, in particular to a GAN-based speech adversarial sample generation method. Background technique [0002] An adversarial example refers to a sample that is purposefully added with subtle perturbations. Its main purpose is to cause the performance of the deep neural network to fail, and even induce the deep learning network to make the judgment specified by the attacker. The method of constructing adversarial samples is actually a process of seeking the optimal perturbation. At present, the more common adversarial sample generation methods are divided into methods based on optimization against perturbation and methods based on perturbation. [0003] Using an optimization algorithm to find an adversarial perturbation is usually to set a target optimization function that satisfies the adversarial sample conditions, and find the optimal perturbation that satisfies the constraints; the method ba...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/24G10L25/30G06N3/04G06N3/08
CPCG10L25/24G10L25/30G06N3/08G06N3/045Y02T10/40
Inventor 王让定王冬华董理严迪群
Owner NINGBO UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products