The invention is suitable for the technical field of noise source positioning, and provides a noise source positioning method, device and system based on sound and images, and the method comprises the steps: obtaining sound signals collected at different positions; judging whether a loud noise position exists or not according to the sound intensity of the sound signal; if the loud noise position exists, controlling a camera to carry out image acquisition towards the loud noise position; and performing artificial intelligence algorithm identification on the image, judging whether personnel aggregation exists in the image, and if yes, determining that the position of personnel aggregation is the position of the noise source. According to the method, the device and the system, a sound and image combination mode is adopted, after the loud noise position is recognized according to the sound signal collected by the noise sensor, image collection and recognition are performed on the loud noise position through the camera, whether the noise source exists or not is further determined, and therefore the accuracy of noise source determination and positioning is further guaranteed.