The present invention relates to a method and device for detecting baby crying in a real scene, and a technical solution of a readable medium, comprising: collecting audio data including baby crying in a real scene, labeling and pre-processing the audio data as a data set , to obtain the network input data; input the network input data to the deep neural network including the feature extraction network, the human voice detection network and the cry detection network, and carry out the training of feature extraction, human voice detection and cry detection respectively, and obtain the human The first loss function and the second loss function corresponding to the sound detection network and the cry detection network; the third loss function is obtained by training the deep neural network as a whole, and the baby cry detection model is obtained; The audio data collected in the scene is detected, and the baby crying detection result of the real scene is obtained. The beneficial effect of the present invention is that the baby's cry can be detected more accurately in a relatively short time.