Sound scene recognizing method based on label amplification and multi-spectrum fusion
A scene recognition and spectrogram technology, applied in the field of scene recognition, can solve the problem of not considering clustering and extracting super-category labels, and achieve the effects of fast training convergence, improved performance, and system robustness
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0031] Such as figure 1 As shown, in this embodiment, a sound scene recognition method based on tag amplification and multi-spectrum fusion includes the following steps:
[0032] Step S1: the data set used in this embodiment includes the Development file set and the Evaluation file set of DCASE2017 sound scene recognition; 90% of the Development file set is used as the training part Tr, and the remaining 10% is used as the verification part V1, and the Evaluation file set is used as the verification part V1 As a test part Te. The audio files in each file set are 10 seconds long. Without loss of generality, this embodiment only uses two spectrogram formats to describe the implementation steps: one is the STFT spectrogram, and the other is the CQT spectrogram.
[0033] Step S2: Take out the audio files one by one from Tr, and obtain the STFT time-frequency characteristic value after operations such as framing, windowing, and short-time Fourier transform, and organize the time-...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


