Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Rate distortion optimization rapid decision-making system and method based on deep learning in HEVC intra-frame coding

A rate-distortion optimization and intra-frame coding technology, which is applied in the field of video coding, can solve the problems of HEVC encoders such as high computational complexity, unreasonableness, and reduced complexity.

Active Publication Date: 2020-06-30
蔡晓刚
View PDF3 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First, from the perspective of complexity reduction, the CU / PU partition decision and prediction mode selection are an overall traversal process, which together leads to a large computational complexity of the HEVC encoder.
Therefore, focusing on only one of the tasks does not minimize the complexity
Second, there is an obvious correlation between the CU / PU partition decision and the prediction mode selection, and it is unwise to simply treat these decisions as a separate process and make decisions in two stages

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Rate distortion optimization rapid decision-making system and method based on deep learning in HEVC intra-frame coding
  • Rate distortion optimization rapid decision-making system and method based on deep learning in HEVC intra-frame coding
  • Rate distortion optimization rapid decision-making system and method based on deep learning in HEVC intra-frame coding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0068] Whole flow chart of the present invention is attached figure 1 Shown, the present invention is elaborated below in conjunction with accompanying drawing:

[0069] Step 1. Dataset Preparation

[0070] The present invention selects 86 video sequences as data sets from the Joint Working Group on Video Coding (JCT-VC) and Xiph.org[1]. In order to ensure the diversity of data sets, these videos have different resolution formats, including: SIF , CIF, NTSC, 4CIF, 240p, 480p, 720p, 1080p, WQXGA. A total of 86 video sequences are divided into two non-overlapping sets for training and testing, 72 of which are used to construct the training dataset and 14 are used for testing. In order to ensure the generalization performance of the network model and evaluate the model performance fairly, the videos used for training and testing are randomly selected from different resolutions. Four QP values ​​{22, 27, 32, 37} were selected, and all sequences were encoded in full intra mode w...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a rate distortion optimization rapid decision-making system and method based on deep learning in HEVC intra-frame coding, and belongs to the technical field of video coding. According to the method, the internal relation existing during coding parameter selection is considered, and the CU / PU partition and the prediction mode of the current coding unit are determined at thesame time in combination with the spatial-temporal correlation of the video content, so that the traversal process in the rate distortion optimization process in HEVC intra-frame coding is avoided, the calculation complexity can be reduced to the maximum extent, and the coding time is saved. The method specifically comprises the following steps: 1) preparing a video data set for training and testing a decision network; 2) respectively training the single-step decision network under different QPs by utilizing the training data set, and storing trained network model parameters; 3) embedding thesingle-step decision network into an HEVC reference model HM.15. 0 to realize a low-complexity HEVC encoder, and 4) respectively using the single-step decision network models trained under different QPs to encode the video on the test set, and testing the encoding complexity and RD performance.

Description

technical field [0001] The invention belongs to the technical field of video coding, and in particular relates to a rate-distortion optimization fast decision-making system based on deep learning in HEVC intra-frame coding and a method thereof. Background technique [0002] Video coding technology is the foundation of video services. Due to the huge amount of raw video data directly obtained from visual sensors, in order to effectively transmit and store video, the raw video is compressed with high bit rate-distortion (RD) quality and low complexity. Coding is necessary. In recent years, the further improvement of video resolution and the real-time requirements of video services have put forward higher requirements for video coding technology. As the latest video coding standard, the high-efficiency video coding standard H.265 / HEVC can achieve a significantly high compression ratio. However, H.265 / HEVC introduces more encoding parameters, and it is necessary to determine t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): H04N19/149H04N19/159H04N19/14
CPCH04N19/149H04N19/159H04N19/14Y02T10/40
Inventor 蔡晓刚
Owner 蔡晓刚
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products