Unlock instant, AI-driven research and patent intelligence for your innovation.

Key point thermodynamic diagram guided self-supervised monocular visual odometer method

A monocular vision and key point technology, applied in the field of computer vision, can solve the problems of cumulative error, no focus of deep neural network, and no consideration of the consistency of video image sequence pose, etc., to achieve the effect of improving accuracy and reducing cumulative error

Active Publication Date: 2022-06-24
UNIV OF SCI & TECH BEIJING +1
View PDF14 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Existing self-supervised methods use too many redundant pixels during the training process, making the deep neural network unfocused during the learning process, resulting in cumulative errors in the estimated pose of the network
In addition, these methods only consider the pose consistency between adjacent frames, and do not consider the pose consistency of the video image sequence

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Key point thermodynamic diagram guided self-supervised monocular visual odometer method
  • Key point thermodynamic diagram guided self-supervised monocular visual odometer method
  • Key point thermodynamic diagram guided self-supervised monocular visual odometer method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0059] In order to make the objectives, technical solutions and advantages of the present invention clearer, the embodiments of the present invention will be further described in detail below with reference to the accompanying drawings.

[0060] like figure 1 and figure 2 As shown, an embodiment of the present invention provides a self-supervised monocular visual odometry method guided by a key point heat map, including:

[0061] S101, construct a pose estimation network (PoseNet) and a depth estimation network (DepthNet);

[0062] In this embodiment, in order to control the memory occupation, the input images (referring to RGB images) of the pose estimation network and the depth estimation network are scaled to a size of 416×128.

[0063] In this embodiment, the pose estimation network includes: an encoder and a decoder, wherein ResNet50 can be selected as the encoder, and the encoder outputs 2048 channels of encoded input pose estimation network decoder. The decoder inpu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a self-supervision monocular vision odometer method guided by a key point thermodynamic diagram, and belongs to the field of computer vision. The method comprises the following steps: constructing a pose estimation network and a depth estimation network; inputting the video image sequence into a pose estimation network and a depth estimation network; extracting key points of each frame of image in the video image sequence and generating a key point thermodynamic diagram; multiplying the poses of each pair of adjacent frame images output by the pose estimation network to obtain the pose of a relatively long time period, and calculating a luminosity error loss function of the video image sequence pose consistency constraint based on the depth image output by the depth estimation network and the generated key point thermodynamic diagram; training the pose estimation network and the depth estimation network based on the obtained luminosity error loss function; and estimating the camera pose corresponding to each frame of image in the video image sequence with the pose to be estimated by using the trained pose estimation network. According to the invention, the precision of camera pose estimation can be improved.

Description

technical field [0001] The invention relates to the field of computer vision, in particular to a self-supervised monocular visual odometry method guided by a key point heat map. Background technique [0002] Visual odometry refers to a method of estimating the current position and attitude of a camera based on input video image frames. It can be widely used in robot navigation, autonomous driving, augmented reality, wearable computing and other fields. According to the type and number of sensors used, visual odometry can be divided into monocular visual odometry, binocular visual odometer and visual odometer fused with inertial information. Among them, the monocular visual odometry has the advantages of only needing one camera, less hardware requirements, and no need for correction. [0003] The traditional visual odometry method first extracts and matches image features, and then estimates the relative pose between two adjacent frames according to the geometric relationshi...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/73G06T3/40G06T7/55G06N3/04G06N3/08
CPCG06T7/73G06T3/4007G06T7/55G06N3/08G06T2207/10016G06T2207/10028G06T2207/20081G06T2207/20084G06N3/045
Inventor 曾慧修海鑫刘红敏樊彬张利欣
Owner UNIV OF SCI & TECH BEIJING