A 3D video intelligent multi-domain joint predictive coding method and device

What is AI technical title?
AI technical title is built by PatSnap AI team. It summarizes the technical point description of the patent document.
A technology of predictive coding and viewpoint synthesis prediction, which is applied in the field of 3D video coding to achieve the effect of improving coding efficiency and saving code rate

Active Publication Date: 2022-02-08

TIANJIN UNIV

View PDF11 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] The invention provides a 3D video intelligent multi-domain joint predictive coding method and device. The invention comprehensively analyzes and mines the time domain, space domain and viewpoint domain correlation of 3D video, proposes to use CNN to fuse multi-domain reference information, and proposes a hierarchical A multi-domain prediction mechanism is used to solve the problem of multi-domain reference information fusion; in addition, in the hierarchical prediction mechanism, an effective multi-domain joint prediction network is constructed, and a multi-scale coding unit is designed in the network to extract features. Use CNN to solve the multi-domain joint prediction problem of 3D video, see the description below for details:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0040] The embodiment of the present invention proposes a 3D video intelligent multi-domain joint predictive coding method, see figure 1 , the method builds a suitable multi-domain joint prediction network, takes multi-domain reference information as input, and outputs the multi-domain prediction result of the current coding block. The specific implementation steps are as follows:

[0041] 1. Obtain multi-domain reference information

[0042] 3D video sequences have rich multi-domain correlations within frames, between frames, and between viewpoints. For the current coded block, it has spatial correlation with the adjacent coded pixel area in the frame, has temporal correlation with the co-located block in the inter-frame coded reference frame, and has a viewpoint with the co-located block in the coded reference frame of the adjacent view domain dependencies. The representation of the multi-domain reference information in the present invention and how to obtain it are explai...

Embodiment 2

[0070] Combine below Figure 1-Figure 5 Carry out feasibility verification to the scheme in embodiment 1, see the following description for details:

[0071] figure 1 The technical flow chart of the present invention is given, which mainly includes obtaining multi-domain reference information, constructing a hierarchical multi-domain prediction mechanism, constructing a spatio-temporal prediction network, obtaining spatio-temporal domain prediction results, obtaining viewpoint synthesis prediction blocks, constructing a multi-domain joint prediction network, obtaining There are six parts: comparison of multi-domain prediction results and rate-distortion cost, and selection of the optimal mode.

[0072] figure 2 The hierarchical prediction framework proposed by the present invention is given. It can be seen from the figure that the method includes a spatio-temporal prediction network and a multi-domain joint prediction network. Together with the view synthesis prediction bl...

Embodiment 3

[0077] A 3D video intelligent multi-domain joint predictive coding device, the device includes: a memory, a processor, and a computer program stored in the memory and operable on the processor, and the method described in Embodiment 1 is implemented when the processor executes the program step.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a 3D video intelligent multi-domain joint predictive coding method and device, including: 1) Obtaining multi-domain reference information: reconstructing the left side, the upper side and the upper left side of the current coding block within the range of step size As spatial reference information; use the inter-frame prediction block of temporal correlation of adjacent frames as temporal reference information; use the view synthesis prediction block obtained through view synthesis prediction technology as inter-view reference information; 2) build a spatio-temporal prediction network to The spatio-temporal domain reference information is used as input to obtain the spatio-temporal domain prediction result; 3) A multi-domain joint prediction network is constructed according to the spatio-temporal domain prediction result and the viewpoint synthesis prediction block to obtain the final multi-domain prediction result. The device includes: a memory, a processor, and a computer program stored in the memory and operable on the processor. The processor implements the steps of the method when executing the program.

Description

technical field [0001] The present invention relates to the field of 3D video coding, in particular to a 3D video intelligent multi-domain joint predictive coding method and device. Background technique [0002] With the development of 3D technology, 3D video coding has become a major research hotspot in the field of multimedia. Compared with 2D video, 3D video has more data volume, which brings great challenges to video storage and transmission. Therefore, how to realize efficient 3D video compression coding has important theoretical research significance and practical application value. [0003] As a new generation video coding standard, HEVC (High Efficiency Video Coding) effectively improves compression efficiency. As a 3D extension of HEVC, 3D-HEVC adopts a coding architecture based on the MVD (Multiview Videoplus Depth, multi-viewpoint plus depth) video format. Based on HEVC’s existing technology, it adds a new technology for multi-viewpoint video and depth video cod...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityPatents(China)

IPC IPC(8): H04N19/597H04N19/147H04N19/149H04N19/103H04N19/50G06N3/04G06N3/08

CPCH04N19/597H04N19/147H04N19/149H04N19/103H04N19/50G06N3/08G06N3/045

Inventor雷建军石雅南侯春萍张宗千彭勃

OwnerTIANJIN UNIV

A 3D video intelligent multi-domain joint predictive coding method and device

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology