Hand posture estimation system and method based on RGBD fusion network

A pose estimation, fusion network technology, applied in the field of computer vision and deep learning, can solve the problems of time-consuming, complex 3D data conversion process, unknown generalization ability of other objects, etc., to achieve high efficiency, good applicability, good generalization The effect of sexual ability

Active Publication Date: 2019-08-27
DALIAN UNIV OF TECH
View PDF4 Cites 30 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Moon et al. converted depth data into 3D voxel representation, and used a more complex 3D CNN for learning, achieving higher prediction accuracy. However, the conversion process for constructing 3D data is complex and time-consuming.
[0005] The above research on hand posture detection limits the environment to empty-handed movements and a third perspective facing the camera, reducing occlusion as much as possible, which brings great benefits to joint detection. Convenience, but when the scen

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Hand posture estimation system and method based on RGBD fusion network
  • Hand posture estimation system and method based on RGBD fusion network
  • Hand posture estimation system and method based on RGBD fusion network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0062] Such as figure 1 As shown, the present invention provides a hand posture estimation system based on RGBD fusion network, comprising:

[0063] Global deep feature extraction modules such as image 3 As shown, a parallel two-way cross-fusion residual network is used. The upper road is a high-resolution feature map, and the lower road is a low-resolution feature map. Multi-scale feature fusion is performed by cross-fusion of multi-resolution information, and finally in The high-resolution feature map predicts the network output; the global deep feature extraction module uses fewer residual blocks to ensure the extraction of low-level features. The residual block compares the original input information pixel by pixel with the features processed by the subsequent network add. Most of the existing networks have a serial structure, that is, the feature map is reduced from high resolution to low resolution and then restored from low resolution to high resolution. The resoluti...

Embodiment 2

[0069] The invention provides a hand posture estimation method based on RGBD fusion network, comprising:

[0070] Step 1: Prepare the dataset;

[0071] Step 11: obtain image data set; Described image data set comprises color image and depth image; In the present embodiment, select the FAHD data set of open source sharing, this data set is the data set of the task-oriented operation that Imperial College of Technology establishes, with The first-person perspective captures the daily actions of manipulating objects, and provides 3D position annotations of 21 joint points. The image data uses Realsense SR300 to collect 45 kinds of daily actions interacting with 26 kinds of objects in the three environments of kitchen, social life and work from the first-person perspective, such as pouring milk, opening a bottle, writing, etc., through the magnetic sensor system attached to the hand Automatic labeling. This dataset provides color images corresponding to depth images, totaling 10...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a hand posture estimation system and method based on an RGBD fusion network. The system comprises a global depth feature extraction module, a residual module, a multi-mode feature fusion module and a branch parallel interference elimination module. The global depth feature extraction module adopts two parallel paths of cross-fused residual networks, wherein the upper path isa high-resolution feature map, the lower path is a low-resolution feature map, carries out the multi-scale feature fusion by the cross-fusing multi-resolution information, and finally predicts the network output in a high-resolution feature map. An input part of the system is divided into a depth image processing branch and an RGB color image processing branch, the features extracted by the two branches are subjected to multi-mode fusion to form the global features, the global features are sent to a branch parallel interference elimination module to perform feature extraction of hand branches, and the reinforced hand branch features are obtained and used for final joint position prediction. According to the method, the hand posture estimation with higher accuracy is achieved mainly through the information synthesis of the color images and the depth images.

Description

technical field [0001] The present invention relates to the technical fields of computer vision and deep learning, in particular, to a hand posture estimation system and method based on RGBD fusion network. Background technique [0002] Vision-based 3D hand pose estimation is a hot research issue in computer vision, virtual reality, robotics and other fields, and there have been many research results. However, so far, the estimation of human hand pose based on visual information is still a problem that has not been perfectly solved. The highly flexible changes of finger joints, the high similarity between different fingers, the occlusion of each other when fingers move, and the objects The occlusion of fingers brings great challenges to hand pose estimation and hinders the development of intelligence and full automation in corresponding application fields. Therefore, it is of great significance to develop a hand pose estimation method with better performance. [0003] Rese...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/62G06N3/04
CPCG06V40/28G06V40/11G06N3/045G06F18/214G06F18/253
Inventor 林相波周一丹孙怡马晓红
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products