Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Human skeleton action prediction method based on multi-task non-autoregressive decoding

A technology for human skeleton and action prediction, applied in the field of computer vision, can solve the problem of not considering the prediction error and so on

Pending Publication Date: 2020-11-13
ZHEJIANG UNIV
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Traditional methods often use autoregressive decoding, that is, the prediction result of the next frame is completely dependent on the prediction result of the previous frame, and this recursive method is used to predict the future time, without considering the prediction error will also follow the non-autoregressive The decoding process propagates the fact that

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Human skeleton action prediction method based on multi-task non-autoregressive decoding
  • Human skeleton action prediction method based on multi-task non-autoregressive decoding
  • Human skeleton action prediction method based on multi-task non-autoregressive decoding

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0098] The implementation method of this embodiment is as described above, and the specific steps will not be described in detail. The following only shows the effect of the case data. The present invention is implemented on two data sets with ground-truth labels, namely:

[0099] Human3.6M dataset: This dataset contains about 3,600,000 human 3D poses, including about 15 actions;

[0100] CMU Mocap dataset: This dataset contains about 86,000 human 3D poses, which contain about 8 actions.

[0101] In the above prediction method, the parameters are set as follows N=25, T 1 =50,T 2 =10~25. Finally, the results of the method of this embodiment (referred to as mNAT) on the two data sets are shown in Tables 1 and 2, and the results of some methods in the prior art are also listed in the table for comparison.

[0102] Table 1. Comparison of evaluation indicators in this embodiment on the Human3.6M data set

[0103]

[0104] Table 2. Comparison of evaluation indicators in this...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a human skeleton motion prediction method based on multi-task non-autoregressive decoding, which is used for solving the problem of motion prediction of a human 3D skeleton. The method specifically comprises the following steps of obtaining a human body 3D skeleton key point data set for training, and defining an algorithm target; establishing a graph convolution encoder, and performing feature learning on the input human body 3D skeleton to obtain features of the input skeleton; establishing a classifier, and performing behavior recognition on the input human body 3D skeleton input; establishing a non-autoregressive decoder, and predicting a human body 3D skeleton at a future moment; performing behavior recognition on the predicted human body 3D skeleton by using ashared graph convolution encoder and a classifier; and using the joint learning framework to carry out human body action prediction at a future moment. The method is used for human body action prediction analysis in a real video, and has good effect and robustness for various complex conditions.

Description

technical field [0001] The invention belongs to the field of computer vision, in particular to an action prediction method of a human skeleton based on multi-task non-autoregressive decoding. Background technique [0002] The action prediction problem based on the human skeleton is defined as the following problem: in a set of human skeleton key point sequences containing multiple frames, predict the human key point sequence at a future moment. Human skeleton key points are often used as auxiliary information for some high-level visual tasks, such as human-computer interaction, abnormal behavior detection, etc. The key factors of human skeleton action prediction include modeling the temporal structure. Traditional methods often use autoregressive decoding, that is, the prediction result of the next frame is completely dependent on the prediction result of the previous frame, and this recursive method is used to predict the future time, without considering the prediction err...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06V40/20G06N3/045G06F18/241
Inventor 李玺李斌田健张仲非
Owner ZHEJIANG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products