Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Method and system for adversarial audio generation for white-box scenarios

An audio and scene technology, applied in the field of adversarial sample generation, can solve the problems of poor attack effect and long time consumption, and achieve the effect of improving efficiency and enhancing robustness

Active Publication Date: 2022-03-25
ZHEJIANG UNIV
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the existing white-box confrontation audio generation methods are relatively rudimentary and time-consuming, and the attack effect is poor.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method and system for adversarial audio generation for white-box scenarios
  • Method and system for adversarial audio generation for white-box scenarios
  • Method and system for adversarial audio generation for white-box scenarios

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0052] The present invention will be described in further detail below with reference to the accompanying drawings and examples, and the embodiments are intended to facilitate the understanding of the present invention, but will not be defined.

[0053] Such as figure 1 As shown, a normal voice is carefully added with minimal disturbance by malicious users, which is still normal voice, but it is actually identified as malicious commands by the automatic voice recognition system.

[0054] In one embodiment of the invention, the anti-audio generation system includes five modules: audio data pretreatment module, audio feature extraction module, audio identification module, particle swarm optimization module, and gradient spoofing optimization module. Its overall architecture figure 2 As shown, the specific modules and each module function is as follows:

[0055] 1, audio data pretreatment module

[0056] This module is mainly preprocessing the input audio data. In general, the audio ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present invention relates to the technical field of adversarial sample generation, and in particular to a method and system for adversarial audio generation for white-box scenarios. The method can efficiently generate high-quality adversarial audio, including: selecting a target attack model and source audio and setting an attack Target; preprocess the source audio; extract the MFCC features of the source audio; the target attack model identifies the source audio according to the MFCC features, obtains the recognition result, calculates the CTC loss function between the recognition result and the attack target, and uses the particle swarm optimization algorithm Optimization, looking for the best noise, adding the best noise to the source audio to get the intermediate audio and using the target attack model to identify; if the recognition result is the same as the attack target, the intermediate audio is the confrontation audio; if the recognition result is different from the attack target, then Execute the next step; use the gradient descent algorithm to find the best noise in the middle audio until the recognition result is the same as the attack target, fine-tune the best noise and add the middle audio to get the confrontation audio.

Description

Technical field [0001] The present invention relates to the field of antagonist production, and in particular, to a counter-antiferous generation method and system for white box scenes. Background technique [0002] With the development of machine learning and artificial intelligence, the machine learning model becomes ubiquitous and has become a core technology in many artificial intelligence devices, such as speech recognition models in a voice assistant (for example, Apple Siri, Googlenow, and Amazon Echo). Sound Event Classification Model and Video Classification of Sound Event Classification Models and Video in Intelligent Significance Locks. Although machine learning is excellent, the most recent research shows that the neural network in the machine learning model is easily fooled by an attacker, and they can force the model to generate the result of errors even produce targeted output. This attack method that is called a case-assault is proved to have a high attack success...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L15/22G10L25/24G10L25/45G10L21/0208G10L15/06
CPCG10L15/06G10L15/22G10L21/0208G10L25/24G10L25/45
Inventor 纪守领杜天宇李进锋陈建海
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products