The invention discloses a video region-of-interest extraction method based on visual perception characteristics and encoding information, and relates to the field of video encoding. The video region-of-interest extraction method comprises the following steps of (1) extracting luminance information of a current encoding macro-block from a primary video stream, (2) identifying a space domain visual characteristic saliency region through an inter-frame prediction mode type of the current encoding macro-block, (3) using a mean motion vector, in the horizontal direction, of a previous encoding macro-block and a mean motion vector, in the perpendicular direction, of the previous encoding macro-block as dual dynamic thresholds, identifying a time domain visual characteristic saliency region according to the result of comparison among a motion vector, in the horizontal direction, of the current encoding macro-block, a motion vector, in the perpendicular direction, of the current encoding macro-block and the dual dynamic thresholds, and (4) defining a video interest priority through combination of the identification result of the space domain visual characteristic saliency region and the identification result of the time domain visual characteristic saliency region, and achieving automatic extraction of a region of interest of a video. According to the video region-of-interest extraction method, the important encoding basis can be provided for the video encoding technology based on the ROI.