Acoustic scene identification method based on data enhancement

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
An identification method and scene technology, applied in the field of audio signal processing and deep learning, can solve the problems of high cost of manual labeling, high cost of manual labeling data, insufficient labeled training data, etc.

Active Publication Date: 2019-07-05

SOUTH CHINA UNIV OF TECH

View PDF15 Cites 11 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

At present, manual labeling is mainly used to increase the diversity of labeled data, but the cost of manual labeling is very high

In addition, in different classification tasks, data samples need to be relabeled, which is not universal, making the cost of manual labeling even higher

In order to overcome the problems of high cost of manually labeled data and insufficient labeled training data, data enhancement methods are urgently needed to increase the diversity of limited labeled audio data, thereby improving the adaptability and generalization ability of complex classifiers

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0065] This embodiment discloses a specific implementation process of a sound scene recognition method based on data enhancement, such as figure 1 As shown, the specific steps of the sound scene recognition method are as follows:

[0066] S1. Audio sample preparation: use recording equipment to collect audio samples in different acoustic scenes, and manually mark them, and then divide the above audio samples into training sets and test sets.

[0067] In this embodiment, this step specifically includes the following steps:

[0068] S1.1. Use recording equipment to collect audio data: place recording equipment in different scenes, and record audio samples of corresponding scenes. The sampling frequency is 16kHz, and the number of quantization bits is 16bit.

[0069] S1.2. Divide the dataset: randomly divide the labeled audio samples into disjoint training sets and test sets, where the training set accounts for about 80% and the test set accounts for about 20%.

[0070] S2. Pre...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an acoustic scene identification method based on data enhancement. The method comprises the following steps: firstly, collecting and marking audio samples of different sound scenes; then preprocessing is carried out, and pre-emphasis, framing and windowing processing are carried out on the audio samples; data enhancement is then performed, extracting a harmonic source and an impact source of each audio sample to obtain more sufficient audio samples, extracting logarithmic Mel filter bank characteristics from the audio samples and the harmonic sources and the impact sources of the audio samples, stacking the three characteristics into a three-channel high-dimensional characteristic, and constructing more abundant training samples by adopting a hybrid enhancement technology; and finally, inputting the three-channel high-dimensional features into an Xception network for judgment, and identifying the sound scene corresponding to each audio sample. According to the data enhancement method, the generalization capability of the Xception network classifier can be effectively improved, and the training process of the network is stabilized. When the acoustic scene isidentified, the method can obtain a better identification effect.

Description

technical field [0001] The present invention relates to the technical fields of audio signal processing and deep learning, in particular to a sound scene recognition method based on data enhancement. Background technique [0002] Audio signals contain rich information and have the advantages of being non-contact and natural. A sound scene is a high-level representation of an audio signal at the semantic level. The task of acoustic scene recognition is to associate semantic labels with audio streams to identify categories of sound-producing environments. This technology enables smart devices to perceive the surrounding environment based on sound, so as to make appropriate decisions. At present, there is a massive increase in audio data. Due to the time-consuming and labor-intensive manual labeling of data, there are very few audio samples with accurate labels. Unlabeled audio samples cannot be directly used to train a classifier. How to construct more diverse training dat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/62G10L21/0208G10L25/03G10L25/27G10L25/45

CPCG10L21/0208G10L25/03G10L25/27G10L25/45G06F18/24G06F18/214

Inventor李艳雄张聿晗王武城刘名乐

OwnerSOUTH CHINA UNIV OF TECH

Acoustic scene identification method based on data enhancement

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology