Audio scene recognition method based on integrated learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An integrated learning and scene recognition technology, applied in the field of audio scene recognition based on integrated learning, can solve problems such as limited performance and achieve good classification performance

Active Publication Date: 2019-07-23

TIANJIN UNIV

View PDF4 Cites 4 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

In a multi-source environment, the performance of such a system is very limited

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

specific example

[0082] 1) The monophonic audio signal, the left and right channel audio signals, and the central side channel audio signals are respectively used as three groups of training sets;

[0083] The acquisition of the monophonic audio signal and the central side channel audio signal is:

[0084] Generate a mono audio signal from left and right channel audio signals: Wherein, Mono represents a monaural audio signal, L represents a left channel audio signal, and R represents a right channel audio signal;

[0085] The central side channel audio signal is generated from the left and right channel audio signals: Mid=L+R, Sid=L−R, wherein Mid represents the central channel audio signal, and Sid represents the side channel audio signal.

[0086] 2) Perform audio feature extraction on the three sets of training sets, respectively, and use them to train three classifier networks, such as Figure 2a with Figure 2b As shown, among them, Figure 2a for training on mono audio signals, Fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an audio scene identification method based on integrated learning. The method comprises the following steps: taking a single-channel audio signal, a left-channel audio signal,a right-channel audio signal and a central side-channel audio signal as three groups of training sets respectively; carrying out audio feature extraction on the three groups of training sets respectively for training three classifier networks respectively; taking the audio features of the training set as the input of a classifier network, training the classifier network, and identifying an audio scene in the existing test set according to the output result of the classifier network; and performing integrated learning on the three classifier networks, and identifying the audio scene according to the output after the integrated learning. Compared with the accuracy of a single classifier network, the accuracy of the method is averagely improved by 9.3%. The problem that the learning ability and the generalization ability of a single classifier network are insufficient is well solved, and comprehensive modeling can be carried out on complex audios in the whole data set. A high-performanceaudio scene recognition system can be obtained.

Description

technical field [0001] The invention relates to an audio scene recognition method. In particular, it relates to an audio scene recognition method based on ensemble learning for ensemble learning of multiple audio scene recognition sub-models. Background technique [0002] At present, the following methods are usually used for audio scene recognition. [0003] 1. Audio scene recognition description [0004] The data of audio scene recognition is directly collected in the real environment, so there must be overlapping sounds. Humans live in a complex audio environment and are very good at following specific sound sources while ignoring or simply acknowledging other sound sources. For example, we can have a conversation against a busy background consisting of other people talking or music. The performance of automatic classification for audio scene recognition has been greatly limited in this task. Acoustic mixture signals contain multiple simultaneous sound events, and th...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/62

CPCG06F18/214G06F18/24

Inventor 张涛刘赣俊

Owner TIANJIN UNIV

Audio scene recognition method based on integrated learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

specific example

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology