Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Precise three-dimension hand and human body attitude estimating method based on single depth image

A technology of depth image and human pose, applied in the field of 3D hand and human pose estimation

Inactive Publication Date: 2018-06-01
SHENZHEN WEITESHI TECH
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] Aiming at the problems of perspective distortion and nonlinear mapping, the purpose of the present invention is to provide an accurate 3D hand and human pose estimation method based on a single depth image. Firstly, the overall architecture of the network is given, and then the convolutional neural network The position of the target is improved, and then the input of the system is constructed using the back-projection technique, and finally the voxel pair is composed of four types of building blocks: the volume basic block, the volume residual block, the volume down-sampling block and the volume up-sampling block, as well as the encoder and the decoder. Voxel prediction network

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Precise three-dimension hand and human body attitude estimating method based on single depth image
  • Precise three-dimension hand and human body attitude estimating method based on single depth image
  • Precise three-dimension hand and human body attitude estimating method based on single depth image

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0033] It should be noted that, in the case of no conflict, the embodiments in the present application and the features in the embodiments can be combined with each other. The present invention will be further described in detail below in conjunction with the drawings and specific embodiments.

[0034] figure 1 It is an overall architecture diagram of a voxel-to-voxel prediction network for an accurate 3D hand and human pose estimation method based on a single depth image of the present invention. First, by back-projecting the points into the 3D space and discretizing the continuous space, the 2D depth map is transformed into a 3D volume representation; then, the 3D voxelized data is used as the input of the voxel-to-voxel prediction network , used to estimate the likelihood value of each voxel of each joint; finally, find out the position corresponding to the maximum likelihood value of each joint and the real coordinates it represents, and use this as the final result of the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a precise three-dimension hand and human body attitude estimating method based on a single depth image; the method mainly comprises the following contents: a network model, an improved target position, system input, a voxel-to-voxel prediction network; the method comprises the following steps: firstly providing a network whole framework, using a convolution nerve network-based method to improve the target position, using a back projection technology to build the system input, finally using four structure blocks and an encoder and a decoder to form the voxel-to-voxel prediction network, wherein the four blocks comprises a volume basic block, a volume residual block, a volume lower sampling block and a volume upper sampling block. The method can solve the perspective distortion and nonlinear mapping problems, can obtain high precision three-dimension hand and human body attitude estimations, uses less time consumption, and can realize real time human body behaviorprediction and estimation.

Description

technical field [0001] The invention relates to the field of three-dimensional hand and human posture estimation, in particular to an accurate three-dimensional hand and human posture estimation method based on a single depth image. Background technique [0002] Human behavior interaction means that the computer understands and responds to human actions and behaviors by locating and identifying humans, tracking human limb movement trajectories, and tracking facial expression features. Its application background is very extensive, mainly focusing on human-computer interaction, virtual reality, smart home, smart security, smart video surveillance, patient monitoring system, auxiliary training for athletes, and content-based video retrieval and smart image compression. The method of human behavior interaction. For example, by detecting and estimating suspicious hand movements or postures of people in public places such as train stations and airports, it can assist security per...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00
CPCG06V20/64G06V40/107G06V40/103
Inventor 夏春秋
Owner SHENZHEN WEITESHI TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products