The invention discloses an audio classification method based on a
convolution neural network and a
random forest. The method comprises the following steps: S1, carrying out
spectral analysis includingsegmenting, framing, windowing and
Fourier transform on an original audio
data set to obtain a frequency
spectrogram corresponding to an original audio file; S2, training a
convolution neural networkfeature extractor by taking the obtained frequency
spectrogram as an input; S3, removing a softmax layer of the
convolution neural network and extracting high-level features of the frequency
spectrogram; S4, training a
random forest classifier by utilizing the extracted high-level features of the frequency spectrogram; S5, based on the extracted high-level features of the convolution neural network, classifying audios by utilizing the trained
random forest. According to the audio classification method disclosed by the invention,
feature extraction is performed based on the convolution neuralnetwork, so that the tedious process of manual construction of extraction features is avoided; meanwhile, for solving the problem of insufficient generalization ability caused by using the softmax asthe convolution
neural network classifier, the softmax layer of the convolution neural network is replaced with the random forest which is used as a final classifier, so that higher accuracy and
recall rate are realized in the testing process.