Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Sound event detection method based on hole convolution recurrent neural network

A technology of cyclic neural network and convolutional neural network, which is applied in speech analysis, speech recognition, instruments, etc., can solve the problems of over-fitting network generalization ability and decline, so as to avoid over-fitting problems, high detection accuracy, The effect of improving generalization ability

Active Publication Date: 2020-08-28
SOUTH CHINA UNIV OF TECH
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003]The existing mainstream sound event detection method based on convolutional neural network has the following shortcomings: In order to increase the receptive field and capture the longer context information of the input audio features, it is necessary to Increasing the number of convolutional layers of the network makes the scale of network parameters very large, which is easy to cause over-fitting problems (the generalization ability of the network decreases)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Sound event detection method based on hole convolution recurrent neural network
  • Sound event detection method based on hole convolution recurrent neural network
  • Sound event detection method based on hole convolution recurrent neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0046] figure 1 It is a flowchart of an embodiment of a sound event detection method based on a hole convolutional cyclic neural network, and the sound event detection method includes the following steps:

[0047] S1. Extracting logarithmic mel spectrum features: pre-emphasizing, framing, and windowing the audio samples, and then extracting the logarithmic mel spectrum of each audio frame;

[0048] In this embodiment, extracting logarithmic mel spectrum features in step S1 specifically includes the following steps:

[0049] S1.1, pre-emphasis: read audio samples, use digital filters for pre-emphasis, where the transfer function of the digital filter is H(z)=1-αz -1 , where α is the filter coefficient and its value is: 0.9≤α≤1;

[0050] S1.2, framing and windowing: divide the read audio samples into frames, the frame length is 0.02s, the frame shift is 0.01s, and the signal of each frame is x′ t (n), the window function is the Hamming window ω(n), and each frame signal x′ t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a sound event detection method based on a cavity convolution recurrent neural network. The method comprises the following steps: extracting logarithm Mel spectrum characteristics of each sample, constructing a cavity convolution recurrent neural network, wherein the cavity convolution recurrent neural network comprises a convolution neural network, a bidirectional long-short-term memory neural network and a Sigmoid output layer, using logarithm Mel spectrum features extracted from a training sample as input to train the cavity convolution recurrent neural network, and identifying a sound event in the test sample by adopting the trained cavity convolution recurrent neural network to obtain a sound event detection result. According to the method, cavity convolution isintroduced into a convolutional neural network, and the convolutional neural network and a recurrent neural network are optimized and combined to obtain a hole convolution recurrent neural network. Compared with a traditional convolutional neural network, the void convolutional recurrent neural network has a larger receptive field under the condition that the sizes of the network parameter sets are the same, contextual information of audio samples can be more effectively utilized, and a better sound event detection result is obtained.

Description

technical field [0001] The invention relates to the technical field of audio signal processing and pattern recognition, in particular to a sound event detection method based on a hole convolutional cyclic neural network. Background technique [0002] The goal of Sound Event Detection (SED) is to accurately identify various target sound events in audio recordings. Acoustic event detection can be applied in many fields related to machine listening, such as traffic monitoring, smart meeting rooms, autonomous driving assistance, and multimedia analysis. Classifiers for sound event detection include deep and shallow models. Depth models mainly include convolutional cyclic neural networks, cyclic neural networks, and convolutional neural networks. Shallow models mainly include random regression forests, support vector machines, hidden Markov models, and Gaussian mixture models. [0003] The existing mainstream sound event detection method based on convolutional neural network h...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/30G10L15/06G10L15/16
CPCG10L25/30G10L15/063G10L15/16
Inventor 李艳雄刘名乐王武城江钟杰陈昊
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products