The invention discloses a method for labeling an audio by using a deep learning model. The method comprises the following steps: A, acquiring the audio and performing voice preprocessing on the acquired audio; b, inputting the audio data subjected to voice preprocessing into a deep learning model for voice recognition and voice annotation, and labeling the audio according to the voice annotation,wherein the deep learning model comprises a deep neural network and a long-short-term memory unit; and C, performing manual proofreading on the label output by the deep learning model. According to the method disclosed by the invention, the tedious work of manual listening, manual labeling and manual proofreading is converted into the work of only needing manual proofreading, and other operationsare automatically carried out by the system model, so that the manpower and time cost can be greatly saved, and the effectiveness is guaranteed.