Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Based on CNN semantic segmentation depth prediction method and device

A technology of depth prediction and semantic segmentation, applied in neural learning methods, image analysis, image enhancement, etc., can solve problems such as inability to solve depth problems, insufficient robustness of geometric consistency, etc., and achieve the effect of optimizing the inaccurate absolute scale

Active Publication Date: 2022-08-09
BEIJING YINGPU TECH CO LTD
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, the semantic SLAM model based on deep learning has no concept of key frames, and is calculated one by one. The monocular SLAM based on geometry cannot solve the depth problem, and is not robust enough in geometric consistency.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Based on CNN semantic segmentation depth prediction method and device
  • Based on CNN semantic segmentation depth prediction method and device
  • Based on CNN semantic segmentation depth prediction method and device

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The experimental data set used in this application is the KITTI data set (co-founded by Karlsruhe Institute of Technology in Germany and Toyota American Institute of Technology), which is currently the largest computer vision algorithm evaluation data set in the world for autonomous driving scenarios. The acquisition platform of the KITTI dataset includes: 2 grayscale cameras, 2 color cameras, a Velodyne 3D lidar, 4 optical lenses, and a GPS navigation system. The entire dataset consists of 389 pairs of stereo images and optical flow maps, 39.2 km visual odometry sequences, and more than 200,000 images of 3D annotated objects, each of which includes up to 15 vehicles and 30 pedestrians, and also contains varying degrees of occlude.

[0046] figure 1 This is a flowchart of a method for depth prediction based on CNN semantic segmentation according to an embodiment of the present application. see figure 1 , the method includes:

[0047] 101: Obtain the current frame fr...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The present application discloses a depth prediction method and device based on CNN semantic segmentation, and relates to the field of semantic segmentation. The method includes: acquiring a current frame from a data set, calculating an ORB feature and a map point cloud of the current frame; according to the ORB feature and map point cloud of the current frame, determining that the current frame is selected as a key when a specified condition is satisfied frame; the CNN convolutional neural network is used to associate the key frame with the depth map to make depth prediction, and the CNN is used to label the depth map with semantic labels to achieve semantic association. The device includes: a calculation module, a selection module and a prediction module. This application no longer inputs each image into the model, which solves the memory problem, and adjusts the depth regression through CNN, which optimizes the problem of inaccurate absolute scale during 3D reconstruction.

Description

technical field [0001] The present application relates to the field of semantic segmentation, and in particular, to a method and device for depth prediction based on CNN semantic segmentation. Background technique [0002] Semantic SLAM (Simultaneous Localization And Mapping) is applied in image semantic segmentation, semantic map construction and other directions, expanding the research content of traditional SLAM problems. At present, there have been some researches that integrate semantic information into SLAM. For example, using the geometric consistency between the images obtained in the SLAM system to promote image semantic segmentation, or using the results of semantic segmentation / mapping to promote SLAM localization / loop closure, etc. [0003] Existing semantic SLAM models such as MaskFusion are essentially RGBD-SLAM+ semantic segmentation mask-rcnn, that is, the fusion of geometric SLAM model and deep learning SLAM model, which mainly solves the problem of understa...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06T7/11G06N3/04G06N3/08G06T7/33G06T7/50
CPCG06T7/11G06T7/50G06T7/33G06N3/084G06T2207/10028G06N3/044G06N3/045
Inventor 吴霞
Owner BEIJING YINGPU TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products