Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Regitive audio sample generation method based on three-population parallel genetic algorithm

An audio sample and genetic algorithm technology, applied in the field of speech recognition, can solve the problems of constrained speech adversarial attack algorithm practicability, large computing resources and time overhead, weak research on adversarial samples, etc., to improve the global search ability, reduce the amount of calculation and The effect of strong time and global search ability

Pending Publication Date: 2022-04-19
SHANDONG UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

But on the one hand, compared with the generation of adversarial examples in the field of computer vision, it is more difficult to design adversarial examples in the field of speech; on the other hand, the current research on adversarial examples in the field of speech is relatively weak
At present, the adversarial attack algorithms in the voice field are all based on the optimized C&W attack algorithm design. This method often requires huge computing resources and time overhead, which seriously restricts the practicability of the current voice adversarial attack algorithms.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Regitive audio sample generation method based on three-population parallel genetic algorithm

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] The present invention is described in detail below in conjunction with the accompanying drawings and embodiments:

[0034] as Figure 1 As shown, the present invention is based on adversarial audio sample generation method based on three populations of parallel genetic algorithms, comprising the following steps:

[0035] A: Initialize each original speech file in the speech dataset to a raw audio sample in the form of a binary string; then proceed to Step B;

[0036] In the present invention, the speech recognition model employs tensorflow official speech_commands. Using this speech recognition model, 10 groups of speech files that have been classified by labels are trained for recognition, and the labels of each group of speech files are corresponding to an English word, such as go or stop, etc., and each group of speech files includes speech files that are pronounced the English word by different speakers. In the present embodiment, the number of each set of speech files ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an adversarial audio sample generation method based on a three-population parallel genetic algorithm. The adversarial audio sample generation method comprises the following steps: A, obtaining an original audio sample; b, obtaining a corresponding input sample, a main population and two auxiliary populations; c, the fitness score of each individual is calculated; d, sorting all individuals in the main population and the auxiliary population according to a sequence of fitness scores from low to high; e, using a speech recognition model to carry out classification recognition on all the sorted individuals in the main population in sequence; and F, performing intersection, variation and individual updating on the main population and the auxiliary population by using a three-population parallel genetic algorithm, and then returning to the step D. According to the method, the optimal solution meeting the requirement can be obtained through multiple iterations, the problem of unknowability of the target network and errors caused by Mel-frequency cepstrum coefficient conversion is fully solved, and the method has the advantages of being high in convergence speed, high in global search capability and high in convergence efficiency.

Description

Technical field [0001] The present invention relates to the field of speech recognition, in particular to an adversarial audio sample generation method based on three populations of parallel genetic algorithms. Background [0002] With the success of deep learning models in speech recognition applications, automatic speech recognition control systems such as Amazon Alexa, Google Voice Assistant, Apple Siri, Microsoft Cortana and iFLYTEK have been widely used in human-computer interaction, and have been successful in mobile devices, smart homes and other fields, especially in autonomous driving, voiceprint authentication and other high-level security application scenarios to achieve key applications. [0003] And recent studies have shown that neural networks are vulnerable to adversarial attacks. There is also a problem in the field of speech recognition, where an attacker adds a slight perturbation to the audio, causing the neural network to input a completely different value, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L17/02G10L17/04G10L15/02G10L15/06G10L25/03G06K9/62G06N3/12
CPCG10L17/02G10L17/04G10L15/02G10L15/063G10L25/03G06N3/126G10L2015/0635G10L2015/0636G10L25/24G06F18/24G06F18/214
Inventor 徐东亮翟文升马骁刘志伟徐舜杨承林
Owner SHANDONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products