Speech adversarial sample generation method

A technology for adversarial samples and speech, applied in speech analysis, speech recognition, neural learning methods, etc., can solve the problem that adversarial samples are easy to be identified by humans, and achieve the effect of increasing generalization ability and accelerating convergence.

Active Publication Date: 2019-10-25
XI AN JIAOTONG UNIV
View PDF7 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0010] The present invention studies the DeepSpeech speech recognition system based on the cyclic neural network model structure, and proposes a targeted speech adversarial samp...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech adversarial sample generation method
  • Speech adversarial sample generation method
  • Speech adversarial sample generation method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] The present invention will be described in detail below in conjunction with the accompanying drawings and specific embodiments, where the schematic embodiments and descriptions of the present invention are used to explain the present invention, but not to limit the present invention.

[0042] The overall implementation process of this algorithm is as follows: figure 1 shown, including the following steps:

[0043] 1) Read the input voice data, and perform preprocessing operation on the input voice data, and extract the voice characteristic value of the input voice data. The input voice data format is .wav, the sampling frequency is 16khz, and the numerical precision is 16-bit signed number, that is, the voice data value is [-2 15 ,2 15 -1], the reading method adopts the scipy.io.wavfile module in the scipy library, which is expressed in the form of an array in python, and the speech feature value of the input speech data is extracted using the mfcc algorithm, which is...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a speech adversarial sample generation method, which comprises the steps of reading input speech data, performing preprocessing operation on the input speech data, and extracting a speech feature value of the input speech data; loading a deep neural network model and parameters of the DeepSpeech speech recognition system; meanwhile, the extracted speech characteristic valueis input into a DeepSpeech speech system; calculating probability distribution of a recognition result of each frame of input speech data, initializing an error value and an error threshold value according to CTC Loss between the result and a given target value, and performing clamping operation on the error value and the generated adversarial sample; constructing a loss function of a speech adversarial sample generation algorithm, iterating for multiple times, and updating an error value; and if the identification result of the generated adversarial sample is a given target value, reducing an error threshold to update a threshold error, continuing iteration until iteration is finished, and outputting a result. The similarity between the adversarial sample generated by the algorithm and the original sample is higher.

Description

technical field [0001] The invention belongs to an algorithm for generating an adversarial sample in the field of deep learning security, in particular to a method for generating a voice adversarial sample. Background technique [0002] In recent years, with the development of deep neural networks, deep learning has been gradually applied to various fields, especially in computer vision, speech recognition, natural language processing, etc., which have reached or even surpassed human capabilities. At the same time, the security issues brought about by deep learning are getting more and more attention. Among them, the generation method of adversarial examples has gradually become a hot issue in the field of deep learning technology security. Adversarial samples refer to the samples that the deep neural network model will make wrong judgments after adding small perturbations that are difficult to distinguish by human senses on the original data that the deep neural network mo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G10L15/16G10L15/06G10L15/02G10L25/24G06N3/04G06N3/08
CPCG10L15/16G10L15/063G10L15/02G10L25/24G06N3/08G06N3/045
Inventor 张国和匡泽杰朱聚卿梁峰
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products