The invention discloses an audio classification method based on a 
convolution neural network and a 
random forest. The method comprises the following steps: S1, carrying out 
spectral analysis includingsegmenting, framing, windowing and 
Fourier transform on an original audio 
data set to obtain a frequency 
spectrogram corresponding to an original audio file; S2, training a 
convolution neural networkfeature extractor by taking the obtained frequency 
spectrogram as an input; S3, removing a softmax layer of the 
convolution neural network and extracting high-level features of the frequency 
spectrogram; S4, training a 
random forest classifier by utilizing the extracted high-level features of the frequency spectrogram; S5, based on the extracted high-level features of the convolution neural network, classifying audios by utilizing the trained 
random forest. According to the audio classification method disclosed by the invention, 
feature extraction is performed based on the convolution neuralnetwork, so that the tedious process of manual construction of extraction features is avoided; meanwhile, for solving the problem of insufficient generalization ability caused by using the softmax asthe convolution 
neural network classifier, the softmax layer of the convolution neural network is replaced with the random forest which is used as a final classifier, so that higher accuracy and 
recall rate are realized in the testing process.