Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Speech confrontation sample generation method and device, electronic equipment and storage medium

A technology against samples and speech, applied in speech analysis, neural learning methods, biological neural network models, etc. The effect of detection ability

Active Publication Date: 2022-05-24
INST OF AUTOMATION CHINESE ACAD OF SCI
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

When generating speech adversarial samples, the speech synthesis model usually selects only one speech acoustic feature for acoustic model modeling, and uses a vocoder to reconstruct the parameters into a speech waveform. When the acoustic parameters used by the detection model are inconsistent, because the parameters of the detection features used to generate speech are quite different from the real speech, it is very easy to be detected by the speech generation detection model, and it is impossible to deceive the speech generation detection system
[0003] In addition, the existing technology mainly generates speech adversarial samples by adding random perturbation to the error threshold and clamping the error, which belongs to the passive addition of adversarial samples. Although the speech generation detection model can be deceived to a certain extent, the added noise is easy to cause The sense of hearing of the generated speech is reduced, and it is easy to be recognized and detected from the subjective perspective of humans, and this method does not start from the speech generation detection mechanism, and the generation of adversarial samples is too limited, and can only effectively deceive part of the given speech generation detection model

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Speech confrontation sample generation method and device, electronic equipment and storage medium
  • Speech confrontation sample generation method and device, electronic equipment and storage medium
  • Speech confrontation sample generation method and device, electronic equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0051] In order to make the purposes, technical solutions and advantages of the embodiments of the present disclosure clearer, the technical solutions in the embodiments of the present disclosure will be described clearly and completely below with reference to the accompanying drawings in the embodiments of the present disclosure. Obviously, the described embodiments These are some, but not all, embodiments of the present disclosure. Based on the embodiments in the present disclosure, all other embodiments obtained by those of ordinary skill in the art without creative work fall within the protection scope of the present disclosure.

[0052] see figure 1, an embodiment of the present disclosure provides a method for generating a speech confrontation sample, including the following steps:

[0053] S1, receive target text, and extract text feature sequences from the target text;

[0054] In practical applications, for the target text of the adversarial sample to be generated, ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present disclosure relates to a method and device for generating speech confrontation samples, electronic equipment and a storage medium. The method includes: receiving a target text, and extracting a text feature sequence from the target text; inputting the text feature sequence into a pre-trained The acoustic model of the multidimensional acoustic parameter sequence is obtained; the multidimensional acoustic parameter sequence is input into the pre-trained vocoder model to generate a time-domain sample sequence of speech as an adversarial sample corresponding to the target text, and the output of the acoustic model is a multidimensional acoustic parameter sequence, so that the generated speech content can guarantee high similarity (matching) under the description of various acoustic feature dimensions. Detection ability, more effectively deceive the speech generation detection model.

Description

technical field [0001] The present disclosure relates to the field of speech technology, and in particular, to a method and apparatus for generating a speech confrontation sample, an electronic device, and a storage medium. Background technique [0002] At present, in order to capture more discriminative information, the speech generation detection model uses a variety of acoustic features for speech signal processing, and the acoustic features for speech generation detection are directly fed into the model or used as a basis for discrimination. When generating speech adversarial samples, the speech synthesis model usually selects only one type of speech acoustic feature to model the acoustic model, and uses a vocoder to reconstruct the parameters into speech waveforms. When the acoustic parameters used by the detection model are inconsistent, because the parameters of the detection features used to generate speech are quite different from the real speech, they are easily de...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G10L19/16G10L25/24G10L25/30G06N3/08
Inventor 傅睿博陶建华易江燕
Owner INST OF AUTOMATION CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products