A method for predicting user field of view based on deep learning

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A deep learning, user-friendly technology, applied in the field of computer vision and deep learning, which can solve the problems of high bandwidth and low latency not being solved, affecting the accuracy of video features, and awkward field of view prediction.

Active Publication Date: 2022-04-22

NANJING UNIV

View PDF5 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0002] At present, many innovative applications have appeared in the VR industry, and VR is also gradually entering mobile terminals such as mobile phones, but the problems of high bandwidth and low latency required for smooth VR playback have not been resolved.

In addition, isometric mapping etc. make the distortion of objects in the panorama very obvious, thus also affecting the accuracy of the obtained video features, which is an embarrassing problem for field of view prediction

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0020] In order to make the purpose, technical solution and advantages of the present invention clearer, the implementation method of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0021] A method of predicting the user's field of view based on deep learning in this embodiment, the steps are as follows:

[0022] (1) Map the panoramic video from the spherical surface to the 6 faces of the cube inscribed on the sphere, and obtain the video corresponding to the 6 faces of the cube from the 2D panoramic video. Number the faces of the cube from 1 to 6, and expand them in sequence from 1 to 6 (see attached image 3 ).

[0023] (2) Use the optical flow algorithm to generate the dynamic feature sequence diagram of the six faces of the cube corresponding to the video, and then use the coordinate transformation relationship from the cube to the 2D plane and its numbering sequence to synthesize the panoramic dynamic featu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a method for predicting a user's field of view based on deep learning. The steps are: (1) Map the panoramic video from the spherical surface to the 6 surfaces of the spherical inscribed cube to obtain the videos corresponding to the 6 surfaces, generate the dynamic features and saliency sequence diagrams of the videos respectively, and perform block and numbering ; (2) Judging the severity w of video content viewpoint switching according to the dynamic features; (3) Recording the user's head turning with the helmet and processing it; (4) Selecting the prediction network through the value of w, and using the network prediction to obtain the user's The field of view of the next n frames of video frames can be processed to obtain the number of video blocks that overlap with the field of view; (5) render and transmit the predicted video blocks, and repeat the steps until the last n frames are predicted. The method of the present invention reduces the influence of panorama distortion on the input video features, and at the same time adds the pre-judgment and classification of video information, and can predict the field of view when the user watches the video in the VR HMD with high accuracy.

Description

technical field [0001] The invention relates to the fields of computer vision and deep learning, in particular to a method for predicting a user's field of view based on deep learning. Background technique [0002] At present, many innovative applications have appeared in the VR industry, and VR is gradually entering mobile terminals such as mobile phones. However, the problems of high bandwidth and low latency required for smooth VR playback have not been resolved. Human perception requires smooth and accurate movement of vision, so unsmooth playback and high delay may cause VR users to experience symptoms such as nausea and dizziness, seriously affecting the user's immersive experience. Adding field of view prediction during VR video rendering and transmission can reduce the amount of transmitted data, thereby reducing the rendering and transmission time and effectively reducing transmission delay. [0003] LSTM (Long Short Term Memory) network is a special type of recurr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06T7/246G06V20/40G06F3/01

CPCG06F3/012G06T7/246G06T2207/10016G06V20/40

Inventor 蒲志远沈秋郭佩瑶马展

Owner NANJING UNIV

A method for predicting user field of view based on deep learning

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology