Indoor RGB-D image semantic segmentation method based on wavelet transform

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An RGB-D and wavelet transform technology, applied in the field of computer vision, can solve the problems such as not making full use of the complementarity of RGB color information and depth information, affecting the segmentation accuracy of small target objects with high efficiency, and losing multiple scale information features. Achieve the effect of speeding up inference, reducing the amount of parameters and calculations, and achieving good semantic segmentation

Pending Publication Date: 2022-08-02

GUIZHOU UNIV

View PDF0 Cites 3 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, there are still many problems in the above method: 1) The depth image is simply fused with the color image as the fourth channel, and the complementarity between RGB color information and depth information is not fully utilized; 2) There are problems such as loss of multiple scale information features in different methods, However, for the indoor scene semantic segmentation problem, the efficiency of multi-scale feature extraction affects the segmentation accuracy of small target objects

3) The depth image is more sparse than the RGB image, which represents the depth value of each pixel in the RGB image, and the edge contour information, that is, high-frequency information, can be better obtained from the depth image, but traditional convolution and pooling Such operations tend to lose high-frequency features

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0021] In order to deepen the understanding of the present invention, the present invention will be described in further detail below with reference to the embodiments. The embodiments are only used to explain the present invention and do not constitute a limitation on the protection scope of the present invention.

[0022] according to figure 1 , 2 , 3, this embodiment provides an indoor RGB-D image semantic segmentation method based on wavelet transform, which makes full use of the high-frequency features that the past methods failed to utilize, and retains the contour details and other information in the image information, effectively It greatly improves the accuracy of indoor image semantic segmentation and inference speed, and reduces the amount of parameters of the network.

[0023] First, a convolutional neural network based on wavelet transform is constructed, and a network with an encoder-decoder structure is used. The encoder adopts ResNet-50 as the skeleton network...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an indoor RGB-D image semantic segmentation method based on wavelet transform. The method comprises the following steps: firstly, constructing a convolutional neural network based on wavelet transform; secondly, performing multi-scale fusion on image features of different frequencies of the original image at the joint of the encoder and the decoder through a wavelet transform and inverse transform combined module; and finally, carrying out multiple times of up-sampling on the features through a decoder network, carrying out double up-sampling on the features by each module of a decoder, better carrying out feature mapping through convolution and jump connection of an encoder, gradually recovering a high-resolution image, and outputting a semantic segmentation result. According to the method, high-frequency features which are not utilized by a conventional method are fully utilized, information such as contour details in image information is reserved, the indoor image semantic segmentation accuracy and the reasoning speed are effectively improved, and the parameter quantity of a network is reduced.

Description

technical field [0001] The invention relates to the field of computer vision, in particular to an indoor RGB-D image semantic segmentation method based on wavelet transform. Background technique [0002] With the development of the computer vision field, semantic segmentation of images has become an important topic in this field, classifying each pixel in the image and predicting the shape of its label and position, and providing the relevant scene through the above operations. complete understanding. Semantic segmentation is now widely used in autonomous driving, remote sensing analysis, and medical image processing. For indoor scene semantic segmentation, due to the complex factors affecting indoor scene semantic segmentation (lighting, occlusion, etc.), reducing the influence of these factors in indoor scene research and improving the accuracy of semantic segmentation has become the main problem. . Previous deep learning methods achieve end-to-end image semantic segmen...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06V10/52G06V10/80G06V10/82G06N3/08G06N3/04

CPCG06V10/52G06N3/08G06V10/806G06V10/82G06N3/045

Inventor 张荣芬范润泽刘宇红李景玉

Owner GUIZHOU UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Indoor RGB-D image semantic segmentation method based on wavelet transform

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology