Method for generating sound encoder for sound event detection

A sound coding and event detection technology, applied in instruments, speech analysis, computer components, etc., can solve the problems of insufficient real audio training data of strong labels and unsatisfactory detection results, and achieve the effect of reducing dependence and improving robustness

Active Publication Date: 2021-08-03
WUHAN UNIV
View PDF3 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is that the generation of the existing sound encoder for sound event detection requires a large amount of real audio training data with strong labels, and the lack of real audio training data with strong labels will lead to unsatisfactory detection results

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for generating sound encoder for sound event detection
  • Method for generating sound encoder for sound event detection
  • Method for generating sound encoder for sound event detection

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0064] In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Obviously, the described embodiments are only It is a part of embodiments of the present invention, but not all embodiments. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0065] The inventors have found through research that sound carries a large amount of information about physical events in the daily environment. Through sound, the environment can be perceived, such as streets, offices, etc., and a single sound source can also be identified, such as car engine sound, footsteps, etc. The method of automatically e...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a method for generating a sound encoder for sound event detection, which comprises the following steps of: performing distortion processing on unlabeled pre-training audio signals in a first training set to obtain distortion training signals; inputting the distortion training signal into an initial sound encoder to obtain a first feature vector; determining a second feature vector based on the pre-training audio signal and the perceptron set; modifying parameters of the initial sound encoder based on the first feature vector and the second feature vector to obtain a candidate sound encoder; and training the candidate sound encoder through the fine tuning audio signal with the label in the second training set to obtain a target sound encoder. According to the invention, the initial sound encoder is pre-trained through the pre-training audio signal without the label to obtain the candidate sound encoder, and then the candidate sound encoder is fine-tuned through the fine-tuning audio signal with the label, so that the dependence on a strong label sample in the training process is reduced, and the robustness of the sound encoder is improved through distortion processing.

Description

technical field [0001] The present application relates to the field of sound event detection, in particular to a method for generating a sound coder for sound event detection. Background technique [0002] Sound carries a large amount of information about physical time in the daily environment. It can perceive the environment through sound, such as streets, offices, etc., and can also identify individual sound sources, such as car engine sound, footsteps, etc. The method of automatically extracting sound event information has great application potential in urban security, for example, using sound event information to identify activities in the environment, using sound event information to alert sensitive events, and constructing urban sound systems based on sound event information within the city. Spectrum map, search surveillance video based on sound event information, etc. [0003] The sound event can be determined through the sound event detection task (Sound Event Detec...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L19/00G10L19/16G10L25/51G06K9/62G06K9/46G06K9/00
CPCG10L19/16G10L25/51G06V10/40G06F2218/02G06F2218/08G06F2218/12G06F18/2431
Inventor 任延珍刘武洋何佳庆王丽娜
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products