A 3D Semantic Predictive Segmentation Method for Object Image Based on Asymmetric Coding Network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A coding network and object image technology, which is applied in the field of 3D semantic prediction and segmentation of object images in asymmetric coding networks, can solve the problems of no substantial progress in technology, unsatisfactory results, and low accuracy of manual features. high precision effect

Active Publication Date: 2022-05-31

ZHEJIANG UNIVERSITY OF SCIENCE AND TECHNOLOGY

View PDF9 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the low precision of manual features prevents the technology from making substantial progress, which leads to the unsatisfactory results of traditional methods in the field of semantic segmentation.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0041] The present invention will be further described in detail below with reference to the embodiments of the accompanying drawings.

[0042] A semantic segmentation method based on convolutional neural network proposed by the present invention, the overall implementation block diagram is as follows figure 1 shown, it includes the following steps:

[0043] Step 1-1: Select I original RGB images and their corresponding depth maps, and combine the semantic labels corresponding to each original RGB image to form a training set, and mark the i-th original RGB image in the training set as {L i RGB (p,q)}, compare the training set with {L i RGB (p,q)} The corresponding depth image is denoted as The corresponding semantic label is denoted as Among them, I is a positive integer, I≥700, if I=795, i is a positive integer, 1≤i≤I, 1≤p≤W, 1≤q≤H, W represents {L i RGB (p,q)}, and The width of , H represents {L i RGB (p,q)}, and The height of , W and H are both divisible...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses an object image 3D semantic prediction and segmentation method of an asymmetric coding network. Select RGB images, depth maps and corresponding semantic labels to form a training set, construct a convolutional neural network of an asymmetric encoding network, including two input layers, a hidden layer and three output layers, and input the training set to the convolutional neural network The training is carried out in the training set to obtain the predicted segmented image corresponding to each RGB image in the training set, and the loss function value between the predicted segmented image corresponding to each RGB image in the training set and the corresponding real depth image is calculated, and the minimum value is continuously trained The weight vector and bias item corresponding to the loss function value; the RGB image and depth image to be predicted are input into the trained deep separable convolutional neural network training model to obtain the predicted segmented image. The invention constructs a network structure of an asymmetric coding network, realizes 3D semantic prediction segmentation of an object image, and has high precision of segmentation results.

Description

technical field [0001] The invention relates to a semantic segmentation technology, in particular to a 3D semantic prediction and segmentation method of an object image using an asymmetric coding network. Background technique [0002] The economic take-off has prompted the vigorous development of technology, and artificial intelligence has emerged in response to the needs of the times, and has continuously improved people's quality of life. Various technologies of artificial intelligence have also attracted more and more people's attention. As one of the representatives of artificial intelligence, computer vision tasks have also received increasing attention. 3D semantic segmentation technology is one of the basic computer vision tasks. The importance of visual scene understanding is becoming more and more important. [0003] The goal of indoor 3D semantic segmentation is to predict the class label for each pixel in the input image. It is a fundamental task of computer vis...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06T7/10

CPCG06T7/10G06T2207/10028G06T2207/20081G06T2207/20084

Inventor 周武杰袁建中吕思嘉雷景生钱亚冠何成王海江

Owner ZHEJIANG UNIVERSITY OF SCIENCE AND TECHNOLOGY

A 3D Semantic Predictive Segmentation Method for Object Image Based on Asymmetric Coding Network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology