The invention discloses a video interaction control method and device, and belongs to the technical field of monitoring. The device comprises a PTZ camera (1), a first direction sound collection unit (2), a second direction sound collection unit (3), a processing unit (4), and a communication unit (5). The processing unit (4) is used for recognizing the obtained sound information, calculating the position information of a sound source according to first sound information and second sound information when the obtained sound information belongs to preset sound sample data, and controlling the PTZ camera (1) to turn to the sound source according to the calculated position information. Through the recognition of the sound generated by the sound source, the device controls the PTZ camera (1) to turn to the sound source when the sound generated by the sound source belongs to the preset sound sample data, and carries out the interaction of the obtained video information of the sound source and a preset video interaction object, thereby enabling the emergency of the sound source to be timely fed back to the video interaction object, and enabling the sound source to be rescued or cared timely.