Hybrid sound event detection method based on factor decomposition of supervised variational encoder

A technology of event detection and encoder, applied in the direction of instruments, voice analysis, voice recognition, etc., can solve low-level problems and achieve the effect of improving detection accuracy

Active Publication Date: 2019-07-30
JIANGSU UNIV
View PDF13 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] The present invention provides a factor decomposition method, so that the decomposed features are not disturbed by factors irrelevant to the detection task, and the decomposed features are only for each specific sound event, thereby solving the accuracy rate of multi-category sound event detection in real environments Not high problem, improve the accuracy of detection

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hybrid sound event detection method based on factor decomposition of supervised variational encoder
  • Hybrid sound event detection method based on factor decomposition of supervised variational encoder
  • Hybrid sound event detection method based on factor decomposition of supervised variational encoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts are within the protection scope of the present invention.

[0022] see figure 1 , is the concrete process of the sound event detection method based on factor decomposition of an embodiment provided by the present invention, and the method includes the following steps:

[0023] Step 1: Receive the voice signal and perform preprocessing on the voice signal: mainly divide the voice signal into frames according to a fixed frame length, and there is overlap between frames, that is, there is intra-frame overlap.

[0024] Step 2, extracting the features of the preprocessed speech signal

[0025] Extracting the preprocessed speech sign...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a hybrid sound event detection method based on factor decomposition of a supervised variational encoder. The method includes the following steps that a voice signal is receivedand pretreated; the feature of the pretreated voice signal is extracted; a supervised variational automatic encoder is used for extracting potential attribute space of sound events; a factor decomposition method is used for decomposing various factors that make up a hybrid sound, and then the feature representation related to each specific sound event is obtained through studying; a correspondingsound event detector is used for detecting whether the specific sound events occur or not. A factor decomposition learning way is adopted for solving the problem that the detection accuracy of the sound events is not high when there are relatively many sound event categories in the hybrid sound, the accuracy of real-scene sound event detection is effectively improved, and the method can be used for speaker recognition and other tasks.

Description

technical field [0001] The invention relates to the fields of speech signal processing, pattern recognition and the like, in particular to a sound event detection method related to a variational automatic encoder and a factor decomposition method. Background technique [0002] Multi-category sound event detection refers to detecting whether each event occurs from an event mixed with multiple sounds. Compared with the traditional few-category sound event detection, it has wider applicability in the real field, and has broad application prospects and practical significance in medical scene monitoring, traffic scene sound event detection and other fields. [0003] Traditional multi-category sound event detection methods mainly adopt the ideas of speech recognition and template matching, for example, using a mixture of Gaussian models and hidden Markov models characterized by Mel-frequency cepstral coefficients, or using non-negative matrix factorization to Each kind of event i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G10L25/78G10L15/02G10L15/06G10L25/24G10L25/30G10L25/51
CPCG10L15/02G10L15/063G10L25/24G10L25/30G10L25/51G10L25/78G10L2015/025
Inventor 毛启容高利剑陈静静黄多林张飞飞杨小汕秦谦
Owner JIANGSU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products