Learning-based video coding and decoding framework

A video codec and coding technology, applied in the direction of digital video signal modification, electrical components, image communication, etc., can solve problems that are difficult to meet the needs of intelligent media applications, and achieve the effect of distortion optimization

Active Publication Date: 2018-06-15
UNIV OF SCI & TECH OF CHINA
View PDF4 Cites 12 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

After years of development, the improvement of coding performance is accompanied by the continuous increase of complexity, and it is also facing more and more challenges to further improve the coding performance under the existing hybrid coding architecture.
[0003] However, the current hybrid coding framework usually uses a heuristic method to optimize the coding of images and videos, which is increasingly difficult to meet the needs of complex and intelligent media applications such as face recognition, object tracking, and image retrieval.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Learning-based video coding and decoding framework
  • Learning-based video coding and decoding framework
  • Learning-based video coding and decoding framework

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] The technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are only some of the embodiments of the present invention, not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0022] Embodiments of the present invention provide a learning-based video encoding and decoding framework, the video encoding and decoding framework mainly includes: an encoding end and a decoding end; figure 1 As shown, the encoding end mainly includes: space-time domain reconstruction memory, space-time domain prediction network, iterative analyzer, iterative synthesizer, binarizer, entropy encoder and entropy decoder;

[0023] The space-time do...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a learning-based video coding and decoding framework, which comprises a space-time domain reconstruction memory, a space-time domain prediction network, an iterative analyzer,an iterative synthesizer, a binarization device, an entropy coder and an entropy decoder, wherein the space-time domain reconstruction memory is used for storing a reconstructed video content after coding and decoding; the space-time domain prediction network is used for utilizing the space-time domain correlation of the reconstructed video content, modeling the reconstructed video content througha convolutional neural network and a circulating neural network, and outputting a predicted value of a current coding block, wherein a residual error is formed by subtraction of the predicted value and an original value; the iterative analyzer and the iterative synthesizer are used for coding and decoding the input residual error step by step; the binarization device is used for quantizing the output of the iterative analyzer into a binary representation; the entropy coder is used for carrying out entropy coding on the quantized coding output in order to obtain an output code stream; and theentropy decoder is used for carrying out entropy decoding on the output code stream and outputting the output code stream to the iterative synthesizer. According to the coding framework, the prediction of a space-time domain is realized through the learning-based VoxelCNN (namely space-time domain prediction network), and the control of video coding rate distortion optimization is realized througha residual iterative coding method.

Description

technical field [0001] The invention relates to the technical field of video coding and decoding, in particular to a learning-based video coding and decoding framework. Background technique [0002] Existing image and video coding standards such as JPEG, H.261, MPEG-2, H.264, and H.265 are all based on a hybrid coding framework. After years of development, the improvement of coding performance is accompanied by the continuous increase of complexity, and it is facing more and more challenges to further improve the coding performance under the existing hybrid coding architecture. [0003] However, the current hybrid coding framework usually uses a heuristic method to optimize the coding of images and videos, which is increasingly difficult to meet the needs of complex and intelligent media applications such as face recognition, object tracking, and image retrieval. Contents of the invention [0004] The purpose of the present invention is to provide a learning-based video e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): H04N19/503H04N19/593H04N19/192H04N19/147H04N19/70
CPCH04N19/147H04N19/192H04N19/503H04N19/593H04N19/70
Inventor 陈志波何天宇金鑫刘森
Owner UNIV OF SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products