Perceptual high-definition video coding method based on salient target detection and saliency guidance

A target detection and high-definition video technology, applied in the video field, can solve the problem that the multi-scale features of the CNN model are not fully utilized and integrated.

Active Publication Date: 2020-07-17
深圳市北辰星途科技有限公司
View PDF7 Cites 17 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] In addition, the existing state-of-the-art Salient Object Detection Networks (salient object detection network) are based on pre-trained convolutional neural networks (CNNs) on massive datasets, and they are not very good at multi-scale features in CNN models. It is fully utilized and integrated, and the prediction results of Sali

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Perceptual high-definition video coding method based on salient target detection and saliency guidance
  • Perceptual high-definition video coding method based on salient target detection and saliency guidance
  • Perceptual high-definition video coding method based on salient target detection and saliency guidance

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0089] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0090] In order to facilitate a more accurate understanding of the technical solution of the present invention, the conventional terms in the field used in the present invention are explained:

[0091] channels: channel;

[0092] shuffle: shuffle;

[0093] shufflenet: shuffle network;

[0094] group convolution: group convolution;

[0095] ground truth: In machine learning, the term "ground truth" refers to the classification ac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a perceptual high-definition video coding method based on salient target detection and saliency guidance. The method comprises the following steps: constructing a salient target detection model of a multi-scale pyramid shuffling network; carrying out salient region prediction on video data through the salient target detection model of the multi-scale pyramid shuffling network; and guiding an HEVC video compression standard by utilizing a prediction result, and performing video coding through an adaptive quantization parameter and a significance-based coding unit partitioning strategy. The significant target detection model of the multi-scale pyramid shuffling network is stronger in generalization, and can output a prediction result image of significant target segmentation with higher accuracy; the HEVC video compression standard is guided on the basis of the prediction result image, the video image is divided into a salient region and a non-salient region, dynamic optimization is carried out on rate distortion optimization and quantization parameter selection, finally, a video coding result with better indexes is obtained, the video code stream is smaller, and the image quality is better.

Description

technical field [0001] The invention relates to the field of video technology, in particular to a perceptual high-definition video coding method based on salient object detection and saliency guidance. Background technique [0002] In the information age, with the rapid development of video technology and applications, visual information carriers such as video and images have wider practicability and higher use efficiency, giving full play to their intuition, certainty, efficiency and The high-bandwidth characteristics of video signals have penetrated into all aspects of our work and life. [0003] At present, the videos that people watch on various channels and devices are all compressed videos. If there is no step of video compression, the original image quality and original code stream video will have a considerable amount of data, which is unacceptable for data transmission. At present, the fastest transmission medium optical fiber can only reach 100Mbps. Compression, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): H04N19/103H04N19/124H04N19/147H04N19/176
CPCH04N19/103H04N19/124H04N19/147H04N19/176
Inventor 祝世平谢文韬赵丛杨
Owner 深圳市北辰星途科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products