Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Confronting audio generation method and system for white-box scene

An audio and scene technology, applied in the field of adversarial sample generation, can solve the problems of poor attack effect and long time.

Active Publication Date: 2019-04-09
ZHEJIANG UNIV
View PDF5 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing white-box confrontation audio generation methods are relatively rudimentary and time-consuming, and the attack effect is poor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Confronting audio generation method and system for white-box scene
  • Confronting audio generation method and system for white-box scene
  • Confronting audio generation method and system for white-box scene

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be noted that the following embodiments are intended to facilitate the understanding of the present invention, but do not limit it in any way.

[0053] like figure 1 As shown in Fig. 1, a normal voice is still sounded as a normal voice by a malicious user after a small perturbation is carefully added, but it will actually be recognized as a malicious command by the automatic voice recognition system.

[0054] In an embodiment provided by the present invention, the confrontational audio generation system includes five modules: an audio data preprocessing module, an audio feature extraction module, an audio recognition module, a particle swarm optimization module, and a gradient deception optimization module. Its overall structure is as figure 2 As shown, the specific modules and the functions of each module are as follows:

[0055] 1. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the technical field of confronting sample generation, in particular to a confronting audio generation method and system for a white-box scene. The method can generate a high quality confronting audio efficiently. The method comprises the following steps: selecting a target attacking model and a source audio and setting an attacking target; pre-processing the source audio;extracting an MFCC characteristic of the source audio; recognizing the source audio by the target attacking model according to the MFCC characteristic to obtain a recognizing result, calculating a CTCloss function between the recognizing result and the attacking target and optimizing the function by means of particle swarm optimization, searching for the optimum noise, and adding the optimum noise into the source audio to obtain an intermediate audio and recognizing the noise by using the target attaching model; if the recognizing result is the same as the attacking target, the intermediate audio is the confronting audio; if the recognizing result is different from the attacking target, executing the next step; and searching for the optimum noise of the intermediate audio by using a gradient descent algorithm till the recognizing result is the same as the attacking target and finely adjusting the optimum noise and adding the intermediate audio to obtain the confronting audio.

Description

technical field [0001] The present invention relates to the technical field of adversarial sample generation, in particular to a method and system for adversarial audio generation for white-box scenarios. Background technique [0002] With the development of machine learning and artificial intelligence, machine learning models have become ubiquitous and have become the core technology in many artificial intelligence devices, such as speech recognition models in voice assistants (for example, Apple Siri, GoogleNow, and Amazon Echo), Speaker recognition models in smart voice locks, sound event classification models in acoustic surveillance systems and classification of pornographic videos. Despite machine learning's impressive performance, recent research has shown that the neural networks in machine learning models can be easily fooled by attackers, who can force the models to produce erroneous results or even targeted outputs. This attack method, called adversarial example ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L15/22G10L25/24G10L25/45G10L21/0208G10L15/06
CPCG10L15/06G10L15/22G10L21/0208G10L25/24G10L25/45
Inventor 纪守领杜天宇李进锋陈建海
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products