The invention relates to a
human body behavior recognition method based on an RGB video and a skeleton sequence, and belongs to the technical field of
computer vision and
pattern recognition. The method comprises the following steps: 1, carrying out the
feature extraction of an inputted video segment through a feature
stream, and obtaining a space-time feature map; step 2, generating a skeleton region
heat map by the aid of the attention
stream; 3, extracting the spatial and temporal features of the bone region through the binariar; step 4, generating a local decision result by using the localdecision block; and a fifth step of fusing the local decision results by using the
decision fusion block to obtain a global decision result. According to the invention, two plug-and-play modules, i.e., a Loal decision block and a Decision block, are used for realizing
decision fusion; and the Loal declusion block respectively performs
decision making on the spatial and temporal features of each key area, and the Decision lusion block fuses all
decision making results to obtain a final
decision making result. According to the method, the accuracy of
behavior recognition is effectively improvedon Pen Action and NTU RGB + D data sets.