Real-time large-scene three-dimensional semantic modeling method

A large-scene and semantic technology, applied in the field of computer vision, can solve the problems of poor SLAM system effect and failure to solve 3D semantic reconstruction.

Active Publication Date: 2020-06-19
SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Due to the limitations of 2D semantic segmentation on 3D space understanding, semantic SLAM systems based on 2D semantic se

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time large-scene three-dimensional semantic modeling method
  • Real-time large-scene three-dimensional semantic modeling method
  • Real-time large-scene three-dimensional semantic modeling method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] In order to make the technical problems, technical solutions and beneficial effects to be solved by the embodiments of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0041] It should be noted that when an element is referred to as being “fixed” or “disposed on” another element, it may be directly on the other element or be indirectly on the other element. When an element is referred to as being "connected to" another element, it can be directly connected to the other element or indirectly connected to the other element. In addition, the connection can be used for both fixing function and circuit communication function.

[0042] It is to be understood that the terms "length", "width", "top", "bottom", "front"...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a real-time large-scene three-dimensional semantic modeling method. The method comprises the following steps of S1, constructing a three-dimensional geometric model through an RGB image and a depth image obtained by scanning a scene through a sensor; s2, inputting the three-dimensional geometric model into a three-dimensional convolutional neural network to complete semanticsegmentation; s3, integrating semantic tags output by the three-dimensional convolutional neural network into the three-dimensional geometric model to complete semantic modeling; wherein the step ofconstructing the three-dimensional geometric model and the step of semantic segmentation are combined in a multi-thread mode and are carried out at the same time. Joint real-time three-dimensional geometric reconstruction and semantic reconstruction are realized; by adopting the sparse convolutional neural network and accelerating the calculation of the convolutional network, the performance of real-time operation can be achieved; according to the method, two-dimensional convolution in UNet is replaced by three-dimensional convolution, and meanwhile, the depth of the UNet network is increased,so that the capacity of the convolutional network is larger, and the semantic segmentation accuracy is improved.

Description

technical field [0001] The invention relates to the technical field of computer vision, in particular to a method for real-time three-dimensional semantic modeling of a large scene. Background technique [0002] 3D reconstruction of large scenes is an important problem in the field of computer vision. Technologies such as autonomous driving, indoor robot navigation, AR, and VR all rely on large-scene 3D reconstruction technology. Its goal is to use portable devices to dynamically scan the scene, and through algorithm processing, it can easily generate a 3D model of the entire scene. [0003] In addition to obtaining the geometric information of the scene, in practical applications, another information we are interested in is semantic information, that is, we also want to know which parts the scene can be divided into and what objects they are. This is the goal of 3D semantic segmentation. Good semantic information is very important to realize the complex interaction of in...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06T7/55G06T7/10G06T17/20G06N3/04G06N3/08
CPCG06T7/55G06T7/10G06T17/20G06N3/08G06T2200/24G06T2207/10016G06T2207/10024G06T2207/10028G06T2207/20081G06T2207/20084G06N3/045Y02T10/40
Inventor 方璐韩磊郑添王好谦
Owner SHENZHEN GRADUATE SCHOOL TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products