Unlock instant, AI-driven research and patent intelligence for your innovation.

Monocular video based object depth extraction method

A technology of depth extraction and video, applied in the field of computer vision, can solve the problems of unfavorable practicality, low precision of depth data, large amount of calculation, etc.

Active Publication Date: 2015-06-17
北京融合未来技术有限公司
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] At present, in practical applications, multi-eye cameras or depth cameras are used to directly collect depth information. This collection method has the following four types of problems: 1) The amount of data is very large
2) The accuracy of the depth data is not high, especially the data accuracy of the depth camera drops sharply under the condition of violent movement
3) Existing large amounts of precious monocular video materials cannot be reused
However, multiple frames are required to be jointly optimized, which has high requirements for the continuity of frames in the scene, and at the same time, the amount of calculation is huge, which is not conducive to practical use.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Monocular video based object depth extraction method
  • Monocular video based object depth extraction method
  • Monocular video based object depth extraction method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment approach

[0032] Step 1: Depth map initialization

[0033] Two adjacent frames in a monocular video sequence cannot simply be regarded as images corresponding to the left and right eyes of a person. At present, binocular stereo matching is a relatively mature depth information extraction technology, but it has inherent characteristics: 1) If the baseline (Baseline) of the two images is small, the matching is easy, but the depth accuracy of recovery is not high; If it is too large, it is easy to cause matching difficulties; 2) It is difficult to reliably infer the depth of the occluded part due to lack of information. In comparison, using multi-view stereo matching for depth recovery has more advantages. When initializing the depth map, in order to find the optimal matching, we first need to find the matching pixels. The selection of matching pixels can use the epipolar geometry in the multi-view geometric projection to simplify the search of the entire surface to only the search on the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a monocular video based object depth extraction method, which comprises the following steps: firstly, carrying out pixel projection between adjacent key frames through using the self-calibration results of a camera so as to obtain a matching cost minimum and then obtain a locally-optimized initialized depth map; then, defining an energy function expressing color consistency constraints, geometric consistency constraints, smoothness constraints and initialization depth map information, and converting a depth map extraction problem into an energy function minimization solving problem, so that the obtained depth map is a global optimum map when the energy function is subjected to optimum solution; and then, carrying out anisotropic diffusion on the map, and then obtaining a better map segmentation result by using a Meanshift algorithm. Credible pixels in the global optimum depth map are subjected to plane fitting by using the segmentation result, thereby improving the quality of the depth map better. Meanwhile, the depth continuity of a video sequence on a time shaft is taken into consideration so as to carry out optimization on the time shaft; and finally, non-key frames are performed by using a simplifying algorithm.

Description

technical field [0001] The invention relates to a monocular video-based object depth extraction method, which belongs to the technical field of computer vision. Background technique [0002] Depth information is the main carrier of stereoscopic perception, which can play an important role in many fields such as virtual view synthesis, scene layering, multi-view video compression, and object extraction. [0003] At present, in practical applications, multi-eye cameras or depth cameras are used to directly collect depth information. This collection method has the following four types of problems: 1) The amount of data is very large. 2) The accuracy of the depth data is not high, especially the data accuracy of the depth camera drops sharply under the condition of violent movement. 3) A large number of existing precious monocular video materials cannot be reused. 4) Renewal of the industrial chain is required. The object depth extraction method based on monocular video is a ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06T7/00
Inventor 李炜黄超程浩
Owner 北京融合未来技术有限公司