Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Audio tampering recognition algorithm based on improved neural network

A neural network and recognition algorithm technology, applied in the field of audio tampering, can solve the problems of few and insufficient research on audio tampering recognition, and achieve the effect of improving the recognition rate, good application prospects, and improving the robustness of the model.

Active Publication Date: 2020-02-28
NANJING INST OF TECH
View PDF7 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Firstly, there is insufficient research on the characteristics of audio tampering recognition; secondly, the audio tampering recognition model, the existing audio tampering models are all traditional signal processing models, and machine learning and deep learning are rarely used for analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio tampering recognition algorithm based on improved neural network
  • Audio tampering recognition algorithm based on improved neural network
  • Audio tampering recognition algorithm based on improved neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] The present invention will be further described below in conjunction with the accompanying drawings.

[0043] Such as Figure 1 to Figure 3 As shown, the audio tampering recognition model based on the improved neural network of the present invention comprises the following steps,

[0044] In step A, the Mel spectrogram and frame-level features are extracted from each audio, which are the input of model 1 and model 2 respectively.

[0045] In Model 1, the mel spectrogram is used as the input, because the mel spectrogram of speech shows a lot of information related to the characteristics of the sentence, and it combines the characteristics of the spectrogram and the time domain waveform to show the change of the speech spectrum over time . Since the length of each speech is different, the size of the extracted spectrogram changes with the length of the speech, and all the information of the speech is completely preserved.

[0046] In addition, in the second model, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an audio tampering recognition algorithm based on an improved neural network. A spectrogram of any size is pooled into a CNNs structure represented by a spectrogram of a fixedlength and an LSTM structure with an attention mechanism; a MEL spectrogram and a frame level feature of a signal are introduced into a voice tampering recognition algorithm, and the frequency spectrum and the timing information of an audio signal are integrated; an improved pooling layer is added in the CNNs structure, so that the CNNs can input spectrogram of any size, and the problem that the audio length is not fixed is solved; the weight proportion of high-level features are excavated by virtue of the addition of the attention mechanism, so that the high-quality audio features are finallyobtained; an algorithm of decision fusion is carried out by utilizing the data fusion theory; and the recognition rate of the audio tampering recognition and the robustness of a model are improved. According to the method, the audio tampering can be effectively recognized, and the problem that the traditional audio tampering recognition rate is relatively low is overcome.

Description

technical field [0001] The invention belongs to the field of audio tampering, and in particular relates to an audio tampering recognition algorithm based on an improved neural network. Background technique [0002] The increasing maturity of digital audio editing technology has destroyed the authenticity and integrity of digital audio. When falsified audio is used as evidence in court, it will have a great impact on the judgment of the case. Therefore, judging whether the audio has been tampered with or not is an urgent problem to be solved by the relevant judicial departments. [0003] In 2005, Grigoras.C found that there was a power grid frequency component in the recording signal powered by mains power, and extracted the frequency characteristics of the power grid in the audio to be tested to match and compare with the data in the power grid frequency characteristic database of the power supply department. With a high degree of similarity, it is proposed for the first t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G10L17/04G10L17/18G10L17/14G10L25/24
CPCG10L17/04G10L17/18G10L17/14G10L25/24
Inventor 包永强梁瑞宇唐闺臣王青云冯月芹朱悦
Owner NANJING INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products