A Video Action Recognition Method Based on Spatiotemporal Fusion Features and Attention Mechanism
A technology of time-space fusion and recognition method, applied in character and pattern recognition, computer parts, instruments, etc., can solve problems such as inability to handle sequence problems, and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment
[0050] For the convenience of description, the relevant technical terms appearing in the specific implementation manner are explained first:
[0051] LSTM (Long Short-Term Memory): long short-term memory network;
[0052] figure 1 This is the flow chart of the video behavior recognition method based on the spatiotemporal fusion feature and the attention mechanism of the present invention.
[0053] In this embodiment,
[0054] The LSVRC2012 dataset is used for pre-training of the Inception V3 network, and the HMDB-51 and UCF-101 datasets are used for model simulation and validation analysis.
[0055] The HMDB-51 dataset contains 6849 videos, mainly from movie clips, divided into 51 categories, of which 5222 are used as training set, 300 are used as validation set, and 1327 are used as test set.
[0056] The UCF-101 dataset is a video action recognition dataset collected from real life. The video content is all derived from YouTube videos, including 13,320 videos and a total ...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


