Unlock instant, AI-driven research and patent intelligence for your innovation.

3D attitude estimation method based on graph cavity convolution encoder and decoder

A convolutional coding and pose estimation technology, applied in the field of computer vision, can solve the problems of non-local information ignoring position coding information with rich semantic information, ignoring multi-scale context information and semantic information, etc., to achieve the effect of improving prediction performance

Active Publication Date: 2021-08-13
CHENGDU KOALA URAN TECH CO LTD
View PDF12 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] Based on the above problems, the present invention provides a 3D pose estimation method based on a graph-caused convolutional encoder-decoder, which solves the problem of ignoring multi-scale context information and semantic information in existing methods based on graph neural networks and extracting non-local Information ignores the problem of positional encoding information with rich semantic information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • 3D attitude estimation method based on graph cavity convolution encoder and decoder
  • 3D attitude estimation method based on graph cavity convolution encoder and decoder
  • 3D attitude estimation method based on graph cavity convolution encoder and decoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0036] The present invention will be further described below in conjunction with the accompanying drawings. Embodiments of the present invention include, but are not limited to, the following examples.

[0037] In this embodiment, a 3D pose estimation method based on a graph hole convolution encoder and decoder includes a graph hole convolution encoder decoder model, and the graph hole convolution encoder decoder model is composed of a graph hole convolution GAC and a The graph transformer GTL is combined and stacked to form an encoder-decoder network structure, which can effectively extract local multi-scale context and global long-range connections in pose, and can greatly improve the performance of 3D pose estimation, where:

[0038] The graph hole convolution focuses on expanding the receptive field of the convolution kernel and learns the local multi-scale context, which is used to extract the multi-scale context information in the skeleton. In the graph hole convolution,...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention relates to the field of computer vision, in particular to a 3D attitude estimation method based on a graph cavity convolution encoder and decoder. The method comprises the following steps: S1, selecting a training data set; S2, constructing a graph cavity convolution encoder and decoder model; S3, preprocessing the training data set; S4, carrying out initialization operation on the graph cavity convolution encoder and decoder model; S5, training a graph cavity convolution encoder and decoder model; S6, verifying a graph cavity convolution encoder and decoder model on the selected training data set. According to the method, the multi-scale context information can be effectively extracted, the overall and global range connection can be accurately captured, the information is very helpful for 3D attitude estimation, and the prediction performance of 3D attitude estimation can be greatly improved.

Description

technical field [0001] The invention relates to the field of computer vision, in particular to a 3D attitude estimation method based on a graph hole convolution encoder and decoder. Background technique [0002] For decades, Human Pose Estimation has attracted much attention in the computer vision community. It is a key step in understanding the behavior of characters in images and videos. Human pose estimation includes 2D pose estimation and 3D pose estimation. Among them, 2D pose estimation is mainly 2D human body joint point coordinates are estimated from images, and 3D pose estimation aims to regress from 2D joint point coordinates (or 2D images) to 3D pose estimation. 3D pose estimation is now attracting more and more attention in many computer vision fields , such as intelligent monitoring, human-computer interaction, video understanding, and VR, etc. In this task, the data used is skeleton data, a series of 2D coordinates of human joint points. Compared with RGB data,...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06N3/04G06N3/08
CPCG06N3/08G06V40/20G06N3/045
Inventor 沈复民朱怡燃徐行申恒涛
Owner CHENGDU KOALA URAN TECH CO LTD