Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

General directional voice confrontation sample generation method and system, medium and equipment

A technology against samples and speech, applied in speech analysis, speech recognition, instruments, etc., can solve the problems of sound propagation attenuation, not considering the distortion of speech propagation process, and not being practical enough, so as to improve robustness and make up for general directional physical speech The effect of disturbance

Pending Publication Date: 2022-07-01
XI AN JIAOTONG UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Since the introduction of speech disturbance will bring certain noise on the one hand, and on the other hand, the sound propagation will have attenuation and other phenomena, so the generated speech disturbance cannot be directly realized in the physical world.
In order to solve the noise problem of speech disturbance, the existing methods generate specific disturbances for each piece of raw data, which is not very practical; there are also some methods that can generate general disturbances, but on the one hand, they do not consider the distortion of the speech propagation process in the real world, etc. The problem, on the other hand, only generates non-directional disturbances, which are not practical enough to be easily detected
In order to achieve physical speech perturbation, the existing methods only add random noise during the training process, and do not take into account some attenuation, distortion and reflection phenomena that occur during speech propagation in the air, and the robustness is not strong

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • General directional voice confrontation sample generation method and system, medium and equipment
  • General directional voice confrontation sample generation method and system, medium and equipment
  • General directional voice confrontation sample generation method and system, medium and equipment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are part of the embodiments of the present invention, but not all of the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0055] In the description of the present invention, it is to be understood that the terms "comprising" and "comprising" indicate the presence of the described features, integers, steps, operations, elements and / or components, but do not exclude one or more other features, The existence or addition of a whole, step, operation, element, component, and / or a collection thereof.

[0056] It should also be understood that the terminology used in t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a universal directional voice confrontation sample generation method and system, a medium and equipment, and the method comprises the steps: designing a target optimization loss function to achieve the versatility of voice disturbance, minimizing the confidence coefficient of classification as an original correct category, maximizing the confidence coefficient of classification as a target category, and improving the classification accuracy. Meanwhile, the decibel difference between voice disturbance and original voice is introduced into a loss function, the disturbance size is limited, and the lp norm of the voice disturbance is restrained within a specified spherical range; the disturbance is covered by using the daily noise and psychoacoustic principle, and the influence of sound propagation in the air is introduced when the voice disturbance is generated, so that the generated general voice disturbance is still applicable to the physical world; the universal directional speech adversarial sample is generated after the universal speech disturbance is added to any original speech command data, the speech data is wrongly recognized as a specified target category by a speech command classifier based on the convolutional neural network, and the method has great significance for the research on the robustness of the deep neural network.

Description

technical field [0001] The invention belongs to the field of security technology based on deep learning, and in particular relates to a method, system, medium and device for generating a general directional speech confrontation sample. Background technique [0002] In recent years, with the continuous improvement of the robustness of deep neural networks, many applications based on deep learning have emerged one after another, involving images, speech, text and other fields. However, recent studies have found that applications based on deep neural networks are prone to misidentification of adversarial sample data. The so-called adversarial samples refer to the addition of tiny perturbations that are imperceptible to human senses in the original data, so that the model can identify (classify) wrong false positive data. [0003] Specifically, according to whether the model structure of the network is predicted in advance, it is divided into white-box adversarial samples and b...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L15/22G10L15/20G10L15/06
CPCG10L15/22G10L15/20G10L15/063G10L2015/223
Inventor 王宝旺丁菡赵衰翟临威王鸽惠维赵鲲赵季中
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products