Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

An inference network for 3D coordinate estimation of human joint and a method thereof

A technology of human joints and networks, applied in the field of virtual reality, can solve problems such as 3D inference result error, error accumulation, and loss of information, and achieve the effects of avoiding accumulated error, reducing the amount of calculation, and reducing the degree of nonlinearity

Inactive Publication Date: 2019-02-01
QINGDAO RES INST OF BEIHANG UNIV
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Divide a task into two stages, and the errors generated in each stage will accumulate, which will cause greater errors in the final result performance
The second is that in this solution, the performance of 3D pose estimation depends entirely on the 2D results, which will cause a part of the information to be lost.
However, the methods used in the prior art do not make full use of intermediate features, resulting in large errors in the final 3D inference results.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • An inference network for 3D coordinate estimation of human joint and a method thereof
  • An inference network for 3D coordinate estimation of human joint and a method thereof
  • An inference network for 3D coordinate estimation of human joint and a method thereof

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0049] Example 1, such as figure 2 As shown, in order to make full use of the value of CNN in 3D human pose estimation, this application regards 3D pose estimation as a key point positioning problem in discretized 3D space.

[0050] In human 2D pose estimation, the output structure of the neural network is iteratively processed to generate predictions in multiple processing stages. These intermediate forecasts are gradually refined to produce more accurate estimates.

[0051] The "hourglass network" is such a design structure, which uses a cascading scheme to predict the results multiple times and gradually correct the results.

[0052] In the 3D pose estimation of this application, a prediction scheme from "coarse" to "fine" is designed.

[0053] Given the highest 3D resolution of 64x64x64 with 16 articulation points, the possibility of more than 4 million voxels needs to be estimated. In order to solve the problem of large resolution, the prediction scheme adopted in thi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A 3D coordinate estimation inference network for human joint and a method thereof, taking 3D coordinate estimation as the key point location problem of discretized 3D spatial attitude inference, Instead of directly regressing the 3D coordinates of joints (x, y, z), CNN is trained to predict the probability of each voxel of each joint in the volume, so as to form a 3D thermodynamic map, so as to improve the accuracy of 3D coordinate data for human posture estimation, reduce the non-linearity of the task of directly regressing joints, and improve the learning effect. The inference network is a model structure with n (n >= 2) order hourglass network (Hourglass) as the center and m order (m >= 2) cascade.

Description

technical field [0001] The invention relates to an inference network and method for estimating 3D coordinates of human body joints, and belongs to the technical field of virtual reality. Background technique [0002] The 3D pose estimation of the human body is to accurately estimate the 3D positions of several joints of the human body (such as the head, shoulder, elbow, etc.). Due to the loss of depth information, estimating the position of the 3D joint points of the human body from the RGB video stream is a great challenge in the field of computer vision. [0003] With the development of convolutional neural network (Convolutional Neural Networks, hereinafter referred to as CNN) technology, especially in the field of pattern classification, since the network avoids the complicated preprocessing of images, it can directly input the original image, thus obtaining more Wide range of applications. [0004] At present, more and more tasks in computer vision are solved using CN...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62
CPCG06V40/20G06V2201/07G06F18/217G06F18/253
Inventor 李帅孟文明于洋付延生
Owner QINGDAO RES INST OF BEIHANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products