Learning-based video coding and decoding framework

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A video codec and coding technology, applied in the direction of digital video signal modification, electrical components, image communication, etc., can solve problems that are difficult to meet the needs of intelligent media applications, and achieve the effect of distortion optimization

Active Publication Date: 2018-06-15

UNIV OF SCI & TECH OF CHINA

View PDF4 Cites 12 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

After years of development, the improvement of coding performance is accompanied by the continuous increase of complexity, and it is also facing more and more challenges to further improve the coding performance under the existing hybrid coding architecture.

[0003] However, the current hybrid coding framework usually uses a heuristic method to optimize the coding of images and videos, which is increasingly difficult to meet the needs of complex and intelligent media applications such as face recognition, object tracking, and image retrieval.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0021] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0022] Embodiments of the present invention provide a learning-based video encoding and decoding framework, the video encoding and decoding framework mainly includes: an encoding end and a decoding end; figure 1 As shown, the encoding end mainly includes: space-time domain reconstruction memory, space-time domain prediction network, iterative analyzer, iterative synthesizer, binarizer, entropy encoder and entropy decoder;

[0023] The space-time do...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a learning-based video coding and decoding framework, which comprises a space-time domain reconstruction memory, a space-time domain prediction network, an iterative analyzer,an iterative synthesizer, a binarization device, an entropy coder and an entropy decoder, wherein the space-time domain reconstruction memory is used for storing a reconstructed video content after coding and decoding; the space-time domain prediction network is used for utilizing the space-time domain correlation of the reconstructed video content, modeling the reconstructed video content througha convolutional neural network and a circulating neural network, and outputting a predicted value of a current coding block, wherein a residual error is formed by subtraction of the predicted value and an original value; the iterative analyzer and the iterative synthesizer are used for coding and decoding the input residual error step by step; the binarization device is used for quantizing the output of the iterative analyzer into a binary representation; the entropy coder is used for carrying out entropy coding on the quantized coding output in order to obtain an output code stream; and theentropy decoder is used for carrying out entropy decoding on the output code stream and outputting the output code stream to the iterative synthesizer. According to the coding framework, the prediction of a space-time domain is realized through the learning-based VoxelCNN (namely space-time domain prediction network), and the control of video coding rate distortion optimization is realized througha residual iterative coding method.

Description

technical field [0001] The invention relates to the technical field of video coding and decoding, in particular to a learning-based video coding and decoding framework. Background technique [0002] Existing image and video coding standards such as JPEG, H.261, MPEG-2, H.264, and H.265 are all based on a hybrid coding framework. After years of development, the improvement of coding performance is accompanied by the continuous increase of complexity, and it is facing more and more challenges to further improve the coding performance under the existing hybrid coding architecture. [0003] However, the current hybrid coding framework usually uses a heuristic method to optimize the coding of images and videos, which is increasingly difficult to meet the needs of complex and intelligent media applications such as face recognition, object tracking, and image retrieval. Contents of the invention [0004] The purpose of the present invention is to provide a learning-based video e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): H04N19/503H04N19/593H04N19/192H04N19/147H04N19/70

CPCH04N19/147H04N19/192H04N19/503H04N19/593H04N19/70

Inventor陈志波何天宇金鑫刘森

OwnerUNIV OF SCI & TECH OF CHINA

Learning-based video coding and decoding framework

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology