High-recall-rate weak-annotation sound event detection method

A technology of event detection and recall rate, applied in neural learning methods, computer components, instruments, etc.

Active Publication Date: 2020-12-04
TSINGHUA UNIV
View PDF3 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0009] In order to overcome the shortcomings of the above-mentioned prior art, the purpose of the present invention is to provide a high-recall rate weakly labeled sound event detection method, aiming at the situation of unbalanced samples, so that the final performance of the model is better on the F2 score that pays more attention to the recall rate performance, get more accurate sound event detection results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • High-recall-rate weak-annotation sound event detection method
  • High-recall-rate weak-annotation sound event detection method
  • High-recall-rate weak-annotation sound event detection method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some, not all, embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without creative efforts fall within the protection scope of the present invention.

[0038] Such as figure 1 As shown, the present invention proposes a weakly labeled sound event detection method for unbalanced samples, using the logarithmic mel spectrum of the audio as the audio feature, and can also perform some enhancement processing on the feature. Audio features are encoded by a multi-layer CNN, and the encoded advanced features are input into an attention pooling layer. The attention mechanism is mainly to enhance the characteristics of the e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a high-recall-rate weak-annotation sound event detection method, and the method comprises the steps: setting a neural network and training data corresponding to deep learning;initializing a loss function as cross entropy loss, and adding a plurality of groups of dice losses with different weights, wherein the higher the positive sample proportion is, the larger the required weight is; training, testing and observing experimental results of only using cross entropy loss and increasing a plurality of groups of dice loss with different weights; adjusting a weight hyper-parameter in the loss, and re-performing a plurality of groups of dice loss weight values; carrying out the loop iteration to find out the best effect to complete training, and obtaining a final loss function; applying the final loss function to a neural network detection model, applying the obtained model to a sound event detection system, and obtaining packet-level prediction and frame-level prediction of a sound event through a neural network classifier. According to the method, the problem of non-uniform sample distribution caused by one-to-many classification generally adopted in sound event detection can be solved, and the F2 score paying more attention to the recall rate is effectively improved.

Description

technical field [0001] The invention belongs to the technical field of sound event detection, and in particular relates to a sound event detection method with high recall rate and weak annotation. Background technique [0002] The purpose of sound event detection (Sound event detection, SED) is to identify the sound events that occur in an audio clip, and detect the start and end times of the events. Since the 20th century, with the development of digital signal processing technology, it has become possible to use machines to realize operations such as speech recognition and music processing. With the passage of time, speech recognition technology has become more and more mature, and people have studied more auditory information more extensively. More and more applications, such as environmental sound perception and multimedia information retrieval, have put forward higher requirements for sound event detection technology. demand. Different from tasks such as audio classif...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06N3/045G06F18/2414G06F18/214
Inventor 李青轩杨毅孙甲松
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products