Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Phoneme-level voiceprint recognition confrontation sample construction system and method based on neural network generative model

It is a technology of confronting samples and generating models, which is applied in biological neural network models, neural architectures, speech analysis, etc. It can solve the problems of lack of feasibility, non-transferability, and degradation of attack performance, and achieve low portability, high versatility, The effect of good performance

Pending Publication Date: 2022-02-25
ZHEJIANG UNIV
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Existing adversarial sample attacks in the voiceprint domain are divided into white-box attacks and black-box attacks: white-box attacks assume that the attacker knows the internal details of the target model, but the attack performance on unknown models is significantly reduced, so its practical significance is greatly limited. Large limitation; black-box attack assumes that the attacker has no prior knowledge about the target model, but requires a large number of query accesses to the target model to estimate the gradient, which is not feasible in practical attack scenarios
In addition, existing attack methods generate one-off adversarial samples for a single definite speech through complex iterative optimization algorithms, which have the disadvantages of deterministic input, time-consuming, and non-transferable

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Phoneme-level voiceprint recognition confrontation sample construction system and method based on neural network generative model
  • Phoneme-level voiceprint recognition confrontation sample construction system and method based on neural network generative model
  • Phoneme-level voiceprint recognition confrontation sample construction system and method based on neural network generative model

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The technical solutions of the present invention will be further described below by way of specific embodiments:

[0048] The present invention proposes a phonograph-based sound pattern based on a neural network generating model to identify a counter sample structure system and method. figure 1 The system frame diagram of the present invention is shown, and the entire process is divided into offline training phases and online attack phases, including 5 parts, ie pyro-recognizers, disturbance generators, a hearing sensor, an alternative classifier, and a system optimizer. Offline training phase, the phoneme identifier is decomposed by mandatory alignment techniques into a voice cord sequence, identifying all phonemes and locates its time period in speech; the disturbance generator defines all of the phonemes used, and passed A multi-layer neural network automatically generates a fixed dimension of each phoneme, and according to the phoneme and its location information obtain...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a phoneme-level voiceprint recognition confrontation sample construction system and method based on a neural network generative model. The system comprises a phoneme recognizer, a disturbance generator, a hearing suppressor, an alternative classifier, and a system optimizer, the suppressed disturbance generated by the hearing suppressor and the aligned voice generated by the phoneme recognizer are superposed to generate an adversarial sample, the adversarial sample is classified by the substitution classifier, scores are sent to the system optimizer, gradients are reversely propagated to the disturbance generator for iterative updating, and a trained phoneme disturbance generator is obtained. According to the method, the phoneme information in the voice is creatively fused to carry out disturbance construction, the phonemes in the voice are recognized and positioned by using the phoneme recognizer, and the fine-grained general adversarial disturbance is generated at the phoneme level, so that the adversarial disturbance generated in one step can be reused for any voice text input, finally, text-independent and input-independent general adversarial sample generation is realized, and the generation efficiency of the adversarial sample is greatly improved.

Description

Technical field [0001] The present invention relates to a sound pattern identification and a counterproductive sample, and Background technique [0002] As the most natural and direct communication method of human being, speech has always placed an important role in human-computer interaction and individual identification areas, and gradually become a popular biometric technology. At the same time, it is benefited from the rapid development of deep learning theory and technology, and the sound pattern recognition is applied to a variety of mature products (such as voice assistants, sound locks, etc.) to provide intelligent services for people's work. Studies have shown that 2020 global speech biological identification markets exceed 1.1 million US dollars and is expected to reach $ 3.9 million in 2026, which fully shows the broad development prospects of voice technology. However, after the bright foreground of the sound identification, the deep study of the shadow that is suscep...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L17/18G10L17/04G06N3/04
CPCG10L17/18G10L17/04G06N3/045
Inventor 卢立巴钟杰任奎其他发明人请求不公开姓名
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products