A video group behavior recognition method based on cascaded transformers
A recognition method and group technology, applied in the field of computer vision and deep learning, can solve the problem of not being able to extract video sequence features well, avoid manual feature extraction and offline training, have robustness, and improve recognition accuracy Effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0046] In order to make the object, technical solution and technical effect of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments.
[0047] Such as figure 2 As shown, a video group behavior recognition method based on cascaded Transformer, firstly, collect and generate a video data set, extract the 3D spatio-temporal features from the video data set through a 3D backbone network, and select the key frame image space feature map; for the key frame image space After the feature map is preprocessed, it is sent to the human target detection Transformer to output the human target frame in the key frame image; then, after mapping and filtering, the sub-feature map corresponding to the human target frame on the key frame image feature map is combined with the surrounding key frame image The frame feature map calculates the query / key / value, inputs the group behavior recognition T...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com