Confronting audio generation method and system for white-box scene

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
An audio and scene technology, applied in the field of adversarial sample generation, can solve the problems of poor attack effect and long time.

Active Publication Date: 2019-04-09

ZHEJIANG UNIV

View PDF5 Cites 13 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the existing white-box confrontation audio generation methods are relatively rudimentary and time-consuming, and the attack effect is poor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0052] The present invention will be further described in detail below with reference to the accompanying drawings and embodiments. It should be noted that the following embodiments are intended to facilitate the understanding of the present invention, but do not limit it in any way.

[0053] like figure 1 As shown in Fig. 1, a normal voice is still sounded as a normal voice by a malicious user after a small perturbation is carefully added, but it will actually be recognized as a malicious command by the automatic voice recognition system.

[0054] In an embodiment provided by the present invention, the confrontational audio generation system includes five modules: an audio data preprocessing module, an audio feature extraction module, an audio recognition module, a particle swarm optimization module, and a gradient deception optimization module. Its overall structure is as figure 2 As shown, the specific modules and the functions of each module are as follows:

[0055] 1. ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to the technical field of confronting sample generation, in particular to a confronting audio generation method and system for a white-box scene. The method can generate a high quality confronting audio efficiently. The method comprises the following steps: selecting a target attacking model and a source audio and setting an attacking target; pre-processing the source audio;extracting an MFCC characteristic of the source audio; recognizing the source audio by the target attacking model according to the MFCC characteristic to obtain a recognizing result, calculating a CTCloss function between the recognizing result and the attacking target and optimizing the function by means of particle swarm optimization, searching for the optimum noise, and adding the optimum noise into the source audio to obtain an intermediate audio and recognizing the noise by using the target attaching model; if the recognizing result is the same as the attacking target, the intermediate audio is the confronting audio; if the recognizing result is different from the attacking target, executing the next step; and searching for the optimum noise of the intermediate audio by using a gradient descent algorithm till the recognizing result is the same as the attacking target and finely adjusting the optimum noise and adding the intermediate audio to obtain the confronting audio.

Description

technical field [0001] The present invention relates to the technical field of adversarial sample generation, in particular to a method and system for adversarial audio generation for white-box scenarios. Background technique [0002] With the development of machine learning and artificial intelligence, machine learning models have become ubiquitous and have become the core technology in many artificial intelligence devices, such as speech recognition models in voice assistants (for example, Apple Siri, GoogleNow, and Amazon Echo), Speaker recognition models in smart voice locks, sound event classification models in acoustic surveillance systems and classification of pornographic videos. Despite machine learning's impressive performance, recent research has shown that the neural networks in machine learning models can be easily fooled by attackers, who can force the models to produce erroneous results or even targeted outputs. This attack method, called adversarial example ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G10L15/22G10L25/24G10L25/45G10L21/0208G10L15/06

CPCG10L15/06G10L15/22G10L21/0208G10L25/24G10L25/45

Inventor纪守领杜天宇李进锋陈建海

OwnerZHEJIANG UNIV

Confronting audio generation method and system for white-box scene

What is AI technical title? AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document. An audio and scene technology, applied in the field of adversarial sample generation, can solve the problems of poor attack effect and long time.

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
An audio and scene technology, applied in the field of adversarial sample generation, can solve the problems of poor attack effect and long time.

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology