Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Indoor RGB-D image semantic segmentation method based on wavelet transform

An RGB-D and wavelet transform technology, applied in the field of computer vision, can solve the problems such as not making full use of the complementarity of RGB color information and depth information, affecting the segmentation accuracy of small target objects with high efficiency, and losing multiple scale information features. Achieve the effect of speeding up inference, reducing the amount of parameters and calculations, and achieving good semantic segmentation

Pending Publication Date: 2022-08-02
GUIZHOU UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are still many problems in the above method: 1) The depth image is simply fused with the color image as the fourth channel, and the complementarity between RGB color information and depth information is not fully utilized; 2) There are problems such as loss of multiple scale information features in different methods, However, for the indoor scene semantic segmentation problem, the efficiency of multi-scale feature extraction affects the segmentation accuracy of small target objects
3) The depth image is more sparse than the RGB image, which represents the depth value of each pixel in the RGB image, and the edge contour information, that is, high-frequency information, can be better obtained from the depth image, but traditional convolution and pooling Such operations tend to lose high-frequency features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Indoor RGB-D image semantic segmentation method based on wavelet transform
  • Indoor RGB-D image semantic segmentation method based on wavelet transform
  • Indoor RGB-D image semantic segmentation method based on wavelet transform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0021] In order to deepen the understanding of the present invention, the present invention will be described in further detail below with reference to the embodiments. The embodiments are only used to explain the present invention and do not constitute a limitation on the protection scope of the present invention.

[0022] according to figure 1 , 2 , 3, this embodiment provides an indoor RGB-D image semantic segmentation method based on wavelet transform, which makes full use of the high-frequency features that the past methods failed to utilize, and retains the contour details and other information in the image information, effectively It greatly improves the accuracy of indoor image semantic segmentation and inference speed, and reduces the amount of parameters of the network.

[0023] First, a convolutional neural network based on wavelet transform is constructed, and a network with an encoder-decoder structure is used. The encoder adopts ResNet-50 as the skeleton network...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an indoor RGB-D image semantic segmentation method based on wavelet transform. The method comprises the following steps: firstly, constructing a convolutional neural network based on wavelet transform; secondly, performing multi-scale fusion on image features of different frequencies of the original image at the joint of the encoder and the decoder through a wavelet transform and inverse transform combined module; and finally, carrying out multiple times of up-sampling on the features through a decoder network, carrying out double up-sampling on the features by each module of a decoder, better carrying out feature mapping through convolution and jump connection of an encoder, gradually recovering a high-resolution image, and outputting a semantic segmentation result. According to the method, high-frequency features which are not utilized by a conventional method are fully utilized, information such as contour details in image information is reserved, the indoor image semantic segmentation accuracy and the reasoning speed are effectively improved, and the parameter quantity of a network is reduced.

Description

technical field [0001] The invention relates to the field of computer vision, in particular to an indoor RGB-D image semantic segmentation method based on wavelet transform. Background technique [0002] With the development of the computer vision field, semantic segmentation of images has become an important topic in this field, classifying each pixel in the image and predicting the shape of its label and position, and providing the relevant scene through the above operations. complete understanding. Semantic segmentation is now widely used in autonomous driving, remote sensing analysis, and medical image processing. For indoor scene semantic segmentation, due to the complex factors affecting indoor scene semantic segmentation (lighting, occlusion, etc.), reducing the influence of these factors in indoor scene research and improving the accuracy of semantic segmentation has become the main problem. . Previous deep learning methods achieve end-to-end image semantic segmen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/52G06V10/80G06V10/82G06N3/08G06N3/04
CPCG06V10/52G06N3/08G06V10/806G06V10/82G06N3/045
Inventor 张荣芬范润泽刘宇红李景玉
Owner GUIZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products