Sound event detection method based on full convolutional network

An event detection, convolutional network technology, applied in neural learning methods, biological neural network models, speech analysis, etc., can solve the problems of high algorithm time complexity, long training time, long network training time, etc., to achieve time complexity The effect of low, improved accuracy, and reduced training time

Active Publication Date: 2020-11-24
XIDIAN UNIV
View PDF4 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, because the RNN network cannot process data in parallel, the time complexity of the algorithm is too high, and the network training time is too long
[0007] Although the above-mentioned existing methods can complete the multi-audio event detection task to a certain extent, there are still the following problems: 1) CRNN network captures time series information through LSTM and other RNN networks, but the maximum dependency length that LSTM can effectively capture is Only between 30 and 80, and it is a single scale information
This makes it difficult for RNN-like networks to complete localization and classification at the same time, and the accuracy is low when performing audio event detection tasks with different durations
2) The RNN network is difficult to process in parallel, and the algorithm time complexity is high, resulting in too long network training time

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound event detection method based on full convolutional network
  • Sound event detection method based on full convolutional network
  • Sound event detection method based on full convolutional network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The specific embodiments and effects of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0024] refer to figure 1 , the multi-audio event detection method of this example comprises the following steps:

[0025] Step 1, preprocessing the audio stream signal to obtain a data set.

[0026] In order to realize the effective time-frequency feature extraction of the original audio stream signal, this example uses the Mel cepstrum, a feature extraction method commonly used in the audio field.

[0027] like figure 2 As shown, the specific implementation of this step is as follows:

[0028] 1.1) The original audio stream signal is divided into frames, the length of each frame is 40ms, and the time overlap rate between frames is 50%;

[0029] 1.2) First perform Fourier transform on each frame of audio segment obtained, and then stack the Fourier transform results of each frame along the time dimension to obtain ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a sound event detection method based on a full convolutional neural network, and mainly solves the problems of low multi-audio event detection precision and high time complexity of an existing network. According to the implementation scheme, the method comprises the following steps of 1) performing Mel cepstrum feature extraction on an audio stream to obtain time-frequencyfeature graphs of the audio stream, and forming a training data set by using the time-frequency feature graphs; 2) establishing a full convolution multi-audio event detection network composed of a frequency convolution network, a time convolution network and a decoding convolution network from top to bottom; 3) training the full convolution multi-audio event detection network by using the data set; and 4) inputting the audio stream to be detected into the trained full convolution multi-audio event detection network for multi-audio event detection to obtain the category of the audio event and the existing starting and ending time. Simulation results show that compared with an existing network 3D-CRNN with the highest precision, the precision of the method is improved by 2%, the operation speed is improved by about 5 times, and the method can be used for safety monitoring.

Description

technical field [0001] The invention belongs to the technical field of event detection, in particular to a multi-audio event detection method, which can be used for security monitoring. Background technique [0002] Audio event detection refers to locating sound events of interest in an audio stream and classifying them correctly. Audio information is more practical than image information in the fields of disaster relief, gunshot monitoring, etc. In addition, audio information can also assist video information to complete video surveillance, search tasks, etc., so it is of great practical significance to realize audio event detection. [0003] Sound event detection on real audio streams obtained in real life has always been a very challenging task, because first of all, in reality, different events are highly concurrency, and different sound events often occur at the same time, which requires audio events The detection system can identify multiple audio events from the alia...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/51G10L25/18G10L25/24G10L25/30G06K9/62G06N3/04G06N3/08
CPCG10L25/51G10L25/24G10L25/30G10L25/18G06N3/08G06N3/048G06N3/045G06F18/24Y02A90/10
Inventor 赵光辉张雨萌王迎斌石光明
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products