The invention relates to a region-of-interest video encoding and transmitting method which realize multi-channel combination, and belongs to the technical field of the video encoding and transmission. The method adopts the following steps: step one, space down sampling is performed to the panoramic video with high resolution collected by a panoramic camera, and then encoding is performed after video with low resolution is obtained; step two, region-of-interest detection is performed to video with high definition collected by a visible-light camera, and self-adaptive switching is performed to the two channels of video, namely, downsized video and down-sampled video, according to the area and the position of the region-of-interest detection; step three, an an infrared thermal imager is used for performing detection and tracking to interest targets, encoding infrared region-of-interest video with primary low resolution, and adjusting quantization parameter to realize code rate control; step four, the priority of the three channels of video is set, protecting channel encoding for non-uniform channel is performed according to the priority, and the code is multiplexed into one channel of code stream to be sent to the channel for transmission, and code rate distribution of the channel bandwidth is performed according to the priority. The method guarantees precise detection and high quality encoding to the region of interest when ensuring global detection on the complete scene.