Unlock instant, AI-driven research and patent intelligence for your innovation.

A system and method for semantic scene completion

A semantic scene and completion technology, applied in the field of 3D semantic scene completion, can solve the problems of difficult performance, low representation resolution, poor scalability, etc., to improve the accuracy of semantic scene completion, improve the completion effect, and be practical The effect of sexual security

Inactive Publication Date: 2019-03-08
INST OF COMPUTING TECH CHINESE ACAD OF SCI
View PDF4 Cites 19 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0007] To sum up, the current semantic scene completion has the following problems: it is more difficult to directly extract the features of color images and depth images in 3D space, because the limitation of storage and calculation time makes the voxel representation resolution of the scene not high, thus It will become more difficult to extract high-quality features; for different inputs such as RGB or Depth, using a customized network structure will lead to poor scalability, which is not conducive to the use of more sensors in the future; tightly coupled network design The mode makes it very difficult to improve the performance in the future. If you want to improve the overall performance, you need to change the structure of the entire framework

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A system and method for semantic scene completion
  • A system and method for semantic scene completion
  • A system and method for semantic scene completion

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0073] Such as figure 1 As shown, embodiment one: single-branch structure for Depth or RGB-D input:

[0074] 1) Prepare the training data set.

[0075] The training data set should contain a sufficient number of training samples. The training samples are collected by devices such as RGB-D cameras, or they can come from general data sets. In addition, each image needs to be manually annotated (to obtain the semantic segmentation of each image and the 3D voxel representation of the corresponding scene). This example is mainly aimed at the semantic scene completion of indoor scenes. In this embodiment, the NYU v2 dataset is used for training. The toolbox provided in the NYU v2 dataset can be used to obtain the image pairs corresponding to the synchronized scene color map and depth map, and The semantic segmentation results corresponding to each color image and the 3D voxel representation of the corresponding scene. When training the network, the color map and depth map are use...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to a method and system for completing a semantic scene. The method comprises obtaining a labeled color image and a depth image as training data, wherein the color image and the depth image are labeled with a semantic segmentation label and a semantic scene complement label; training convolution neural network with training data to get semantic segmentation model and semanticscene complement model, input the image to be complemented to semantic segmentation model to get semantic segmentation result; according to the camera parameters and depth images of color images, obtaining the mapping relationship between pixels in color images and voxels in depth images; according to the mapping relationship, projecting the result of semantic segmentation to three-dimensional space, and obtaining the semantic scene surface of the image to be completed; discretizing the surface of the semantic scene and inputting to the semantic scene completion model to obtain the three-dimensional structure of the image to be completed and the category of the object in the image to be completed, and outputting the three-dimensional structure and the category as the semantic scene completion results of the image to be completed.

Description

technical field [0001] The invention relates to the technical field of three-dimensional semantic scene completion, in particular to a multi-sensor three-dimensional semantic scene completion system and method based on a convolutional neural network. Background technique [0002] Semantic scene completion, that is, the restoration and semantic understanding of 3D scenes, has always been an important issue in computer vision technology. For example, robots can interact more with the surrounding environment only if they have the ability to perceive the three-dimensional structure of the scene and understand three-dimensional objects like humans. Therefore, if the three-dimensional structure of the scene can be well restored and the object categories in the scene can be obtained, it will greatly promote the development of artificial intelligence. In addition, semantic scene completion can also be widely used in augmented reality, path planning and navigation, construction of e...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T5/50G06T7/10G06N3/04G06N3/08
CPCG06N3/08G06T5/50G06T7/10G06T2207/20221G06T2207/20084G06T2207/20081G06T2207/10024G06T2207/10028G06N3/045
Inventor 刘世策胡瑜曾一鸣唐乾坤金贝贝韩银和李晓维
Owner INST OF COMPUTING TECH CHINESE ACAD OF SCI