Acoustic scene identification method based on data enhancement
Patent Information
- Authority / Receiving Office
- CN Β· China
- Current Assignee / Owner
- SOUTH CHINA UNIV OF TECH
- Publication Date
- 2019-07-05
Smart Images

Figure 1 
Figure 2 
Figure 3
Abstract
Description
technical field
[0001] The present invention relates to the technical fields of audio signal processing and deep learning, in particular to a sound scene recognition method based on data enhancement. Background technique
[0002] Audio signals contain rich information and have the advantages of being non-contact and natural. A sound scene is a high-level representation of an audio signal at the semantic level. The task of acoustic scene recognition is to associate semantic labels with audio streams to identify categories of sound-producing environments. This technology enables smart devices to perceive the surrounding environment based on sound, so as to make appropriate decisions. At present, there is a massive increase in audio data. Due to the time-consuming and labor-intensive manual labeling of data, there are very few audio samples with accurate labels. Unlabeled audio samples cannot be directly used to train a classifier. How to construct more diverse training dat...