Multi-view feature fusion method and system for 3D human body posture estimation
A human body posture and feature fusion technology, applied in neural learning methods, calculations, computer components, etc., can solve problems such as partial occlusion of different viewing angles, failure to construct learning models, and neglect of local spatial correlation, etc., to achieve high flexibility.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0040] In this embodiment, a multi-view feature fusion method based on a hybrid attention mechanism is disclosed, such as figure 1 shown.
[0041] The main goal of this embodiment is to obtain the position of the 3D human body posture in absolute world coordinates, that is, the set of three-dimensional coordinates of each joint point of the human body posture A specific number is assigned to each joint point, and the reconstructed joint points are connected in sequence to form a three-dimensional human skeleton.
[0042] The steps of the multi-view feature fusion method based on the hybrid attention mechanism of this embodiment are as follows:
[0043] S1. Obtain target images from different perspectives that require attitude estimation;
[0044] During the specific implementation, the target image that needs attitude estimation can be obtained through a device such as a camera, and the camera can be placed at different positions to obtain images of different perspectives. In...
Embodiment 2
[0084] For the constructed multi-view feature fusion model based on hybrid attention mechanism, the example process in practical application is as follows: image 3 , the process is as follows:
[0085] S1. Obtain the data of the original image, and then preprocess the original image data to obtain the preprocessed image;
[0086] S2, performing data enhancement on the preprocessed image to obtain an enhanced image;
[0087] S3. Input the preprocessed image as an input into the model constructed above to obtain the predicted 3D human posture representation, Figure 4 is the visualization result of the model;
[0088] allowable Figure 4 See the comparison effect of the 2D human body posture predicted by the present invention and the ground truth (ground truth), and the predicted 3D human body posture;
[0089] S4. Visually display the output on the user's mobile phone or computer screen.
Embodiment 3
[0091] The present invention also provides a computer system, including a memory, a processor, and a computer program stored in the memory and running on the processor, where the processor implements the steps of any of the foregoing embodiments when the processor executes the computer program.
[0092] In summary, the present invention learns the channel correlation of multi-view depth feature sets through SENet, uses a neural network module to learn the local space correlation of channel feature maps, and generates weight features in the form of learning masks for each channel feature map element, And it is integrated into a unified 3D human posture representation according to the perspective, which has the characteristics of self-adaptation and high flexibility.
[0093] The invention can solve the "partial occlusion" problem in the technical solution of 3D human posture estimation with the idea of feature fusion, and can be easily embedded into the multi-view 3D human pos...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


