Perceptual high-definition video coding method based on salient target detection and saliency guidance

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A target detection and high-definition video technology, applied in the video field, can solve the problem that the multi-scale features of the CNN model are not fully utilized and integrated.

Active Publication Date: 2020-07-17

深圳市北辰星途科技有限公司

View PDF7 Cites 17 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] In addition, the existing state-of-the-art Salient Object Detection Networks (salient object detection network) are based on pre-trained convolutional neural networks (CNNs) on massive datasets, and they are not very good at multi-scale features in CNN models. It is fully utilized and integrated, and the prediction results of Salient Object Detection (salient object detection) are not well used in engineering applications such as video processing. At the same time, in terms of video compression, the most advanced high-efficiency video coding (HEVC) in video compression There is still room for improvement in the size and image quality of the subsequent code stream

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0089] Exemplary embodiments of the present disclosure will be described in more detail below with reference to the accompanying drawings. Although exemplary embodiments of the present disclosure are shown in the drawings, it should be understood that the present disclosure may be embodied in various forms and should not be limited by the embodiments set forth herein. Rather, these embodiments are provided for more thorough understanding of the present disclosure and to fully convey the scope of the present disclosure to those skilled in the art.

[0090] In order to facilitate a more accurate understanding of the technical solution of the present invention, the conventional terms in the field used in the present invention are explained:

[0091] channels: channel;

[0092] shuffle: shuffle;

[0093] shufflenet: shuffle network;

[0094] group convolution: group convolution;

[0095] ground truth: In machine learning, the term "ground truth" refers to the classification ac...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a perceptual high-definition video coding method based on salient target detection and saliency guidance. The method comprises the following steps: constructing a salient target detection model of a multi-scale pyramid shuffling network; carrying out salient region prediction on video data through the salient target detection model of the multi-scale pyramid shuffling network; and guiding an HEVC video compression standard by utilizing a prediction result, and performing video coding through an adaptive quantization parameter and a significance-based coding unit partitioning strategy. The significant target detection model of the multi-scale pyramid shuffling network is stronger in generalization, and can output a prediction result image of significant target segmentation with higher accuracy; the HEVC video compression standard is guided on the basis of the prediction result image, the video image is divided into a salient region and a non-salient region, dynamic optimization is carried out on rate distortion optimization and quantization parameter selection, finally, a video coding result with better indexes is obtained, the video code stream is smaller, and the image quality is better.

Description

technical field [0001] The invention relates to the field of video technology, in particular to a perceptual high-definition video coding method based on salient object detection and saliency guidance. Background technique [0002] In the information age, with the rapid development of video technology and applications, visual information carriers such as video and images have wider practicability and higher use efficiency, giving full play to their intuition, certainty, efficiency and The high-bandwidth characteristics of video signals have penetrated into all aspects of our work and life. [0003] At present, the videos that people watch on various channels and devices are all compressed videos. If there is no step of video compression, the original image quality and original code stream video will have a considerable amount of data, which is unacceptable for data transmission. At present, the fastest transmission medium optical fiber can only reach 100Mbps. Compression, ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): H04N19/103H04N19/124H04N19/147H04N19/176

CPCH04N19/103H04N19/124H04N19/147H04N19/176

Inventor祝世平谢文韬赵丛杨

Owner深圳市北辰星途科技有限公司

Perceptual high-definition video coding method based on salient target detection and saliency guidance

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology