A speech recognition model training method against adversarial sample attack

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A speech recognition model and adversarial sample technology, applied in speech recognition, speech analysis, instruments, etc., can solve problems such as the inability of the human ear to recognize disturbances, and neural network adversarial attacks.

Active Publication Date: 2022-04-12

SHANDONG UNIV

View PDF8 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] While recent studies have shown that neural networks are vulnerable to adversarial attacks

This problem also exists in the field of speech recognition. The attacker adds a slight disturbance to the audio, which will cause the neural network to input completely different values, but the human ear cannot recognize the slight disturbance.

As the voice confrontation attacks that have appeared in recent years have become more and more aggressive and more and more types, the security issues in the field of voice recognition have become prominent, and it has also laid certain hidden dangers for the large-scale commercialization of voice recognition technology.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0046] Below in conjunction with accompanying drawing and embodiment the present invention is described in detail:

[0047] like figure 1 As shown, the speech recognition model training method against adversarial sample attack of the present invention comprises the following steps:

[0048] A: Select a data set consisting of N audio files, and record the Chinese character corresponding to the audio file as y;

[0049] In the present invention, the Chinese speech recognition framework and the Chinese speech data set Free ST-Chinese-Mandarin-Corpus are adopted. The Free ST-Chinese-Mandarin-Corpus dataset consists of N audio files. Each audio file is a sentence read by a reader. Each sentence contains about ten Chinese characters. The corresponding audio files Chinese characters are recorded as y.

[0050] B: Sampling the voice files in the data set selected in step A to obtain the collection tensor;

[0051] When preprocessing the voice file, first cut off the invalid part o...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a speech recognition model training method against adversarial sample attack, comprising the following steps: A: selecting a data set composed of speech files, B: sampling the speech files to obtain the collection tensor; C: collecting the collection tensor Extract the feature tensor; D: Input the feature tensor, the Chinese characters corresponding to the voice file, and the training parameters into the convolutional neural network model, and use the CTC function to map the output tensor to a different output at each time step in combination with the acoustic model and the language model. The probability of Chinese characters, and then select the maximum value of the probability to transcribe the corresponding Chinese characters as the actual output value; finally calculate the deviation value between the target value and the actual output value and record it as loss; E: Use the backpropagation method to update the convolutional neural network model The weight parameters of , update the training parameters of the input convolutional neural network model while updating the weight parameters. The invention can reduce the amount of training calculation and training time, and improve the ability of resisting malicious attacks.

Description

technical field [0001] The invention relates to the field of speech recognition, in particular to a speech recognition model training method against adversarial sample attacks. Background technique [0002] With the development of deep learning and neural network research, the application scenarios of automatic identification are becoming more and more extensive. In the field of speech, deep learning is reshaping the way we interact with computers, such as the personal assistants widely used in smartphones (Apple's Siri, Google's Assistant). These systems run speech recognition models to recognize and execute user commands. In fact, the research on automatic speech recognition is earlier than the appearance of computer, and the technology of speech synthesis and recognition can be traced back to the original vocoder. It was not until the new century that the research on artificial intelligence networks emerged in the field of speech recognition. Most artificial intelligenc...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G10L15/06G10L15/16G10L15/26G10L25/24

CPCG10L15/063G10L15/16G10L25/24

Inventor 徐东亮翟文升刘志伟

Owner SHANDONG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

A speech recognition model training method against adversarial sample attack

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology