Audio scene recognition method based on integrated learning

An integrated learning and scene recognition technology, applied in the field of audio scene recognition based on integrated learning, can solve problems such as limited performance and achieve good classification performance

Active Publication Date: 2019-07-23
TIANJIN UNIV
View PDF4 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In a multi-source environment, the performance of such a system is very limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Audio scene recognition method based on integrated learning
  • Audio scene recognition method based on integrated learning
  • Audio scene recognition method based on integrated learning

Examples

Experimental program
Comparison scheme
Effect test

specific example

[0082] 1) The monophonic audio signal, the left and right channel audio signals, and the central side channel audio signals are respectively used as three groups of training sets;

[0083] The acquisition of the monophonic audio signal and the central side channel audio signal is:

[0084] Generate a mono audio signal from left and right channel audio signals: Wherein, Mono represents a monaural audio signal, L represents a left channel audio signal, and R represents a right channel audio signal;

[0085] The central side channel audio signal is generated from the left and right channel audio signals: Mid=L+R, Sid=L−R, wherein Mid represents the central channel audio signal, and Sid represents the side channel audio signal.

[0086] 2) Perform audio feature extraction on the three sets of training sets, respectively, and use them to train three classifier networks, such as Figure 2a with Figure 2b As shown, among them, Figure 2a for training on mono audio signals, Fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses an audio scene identification method based on integrated learning. The method comprises the following steps: taking a single-channel audio signal, a left-channel audio signal,a right-channel audio signal and a central side-channel audio signal as three groups of training sets respectively; carrying out audio feature extraction on the three groups of training sets respectively for training three classifier networks respectively; taking the audio features of the training set as the input of a classifier network, training the classifier network, and identifying an audio scene in the existing test set according to the output result of the classifier network; and performing integrated learning on the three classifier networks, and identifying the audio scene according to the output after the integrated learning. Compared with the accuracy of a single classifier network, the accuracy of the method is averagely improved by 9.3%. The problem that the learning ability and the generalization ability of a single classifier network are insufficient is well solved, and comprehensive modeling can be carried out on complex audios in the whole data set. A high-performanceaudio scene recognition system can be obtained.

Description

technical field [0001] The invention relates to an audio scene recognition method. In particular, it relates to an audio scene recognition method based on ensemble learning for ensemble learning of multiple audio scene recognition sub-models. Background technique [0002] At present, the following methods are usually used for audio scene recognition. [0003] 1. Audio scene recognition description [0004] The data of audio scene recognition is directly collected in the real environment, so there must be overlapping sounds. Humans live in a complex audio environment and are very good at following specific sound sources while ignoring or simply acknowledging other sound sources. For example, we can have a conversation against a busy background consisting of other people talking or music. The performance of automatic classification for audio scene recognition has been greatly limited in this task. Acoustic mixture signals contain multiple simultaneous sound events, and th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06F18/214G06F18/24
Inventor 张涛刘赣俊
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products