A method and system for video behavior recognition based on spatial enhancement module

A recognition method and space technology, applied in character and pattern recognition, neural learning methods, biological neural network models, etc., can solve problems such as weak classification accuracy and attenuation of extraction ability, achieve strong versatility, enhance spatial characteristics, and optimize space. The effect of feature extraction capabilities

Active Publication Date: 2022-03-29
SOUTH CHINA UNIV OF TECH +1
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, its joint learning of spatio-temporal features leads to a significant attenuation in the ability to extract spatial features alone compared with 2D convolution, so the classification accuracy on data sets with strong spatial features and weak timing is weaker than that of 2D convolution.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for video behavior recognition based on spatial enhancement module
  • A method and system for video behavior recognition based on spatial enhancement module
  • A method and system for video behavior recognition based on spatial enhancement module

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0059] The data set used in this embodiment is the Kinetics-400 data set. Kinetics is a very large video classification dataset. The videos in it are all from the Youtube video website. It contains a total of about 230,000 training set videos and about 20,000 verification set videos. The duration of all videos is 3-10s, and the resolution is uniform. It is 340 × 256 pixels or 256 × 340 pixels; the embodiment runs on the Linux system, the system version is Ubuntu16.04, mainly based on the Caffe framework, the dependent library OpenCV version of Caffe is 3.0, and the python version is 2.7; the graphi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a video behavior recognition method and system based on a spatial enhancement module. The method includes the following steps: decoding the video to be tested into a frame sequence, storing the decoded frame sequence in the form of an image; taking sparse sampling Strategy, divide the video into multiple video clips, extract a frame from each video clip, and combine them into a stacked frame sequence; calculate the mean value of the three channels of all training video frames in the behavior recognition data set, and divide the sampled frames Subtract the calculated mean value from the image; use the residual neural network 3D-ResNet-18 as the backbone network, and construct the behavior recognition and classification network with the space enhancement module; set the training parameters, input the training set into the behavior recognition and classification network for training, and save the training Network parameters; in the model deployment stage, the linear operator of the fusion space enhancement module; the video to be tested is input to the behavior recognition classification network, and the final classification result is output. The invention improves the behavior recognition effect, and has both effectiveness and versatility.

Description

technical field [0001] The invention relates to the technical field of video behavior recognition of computer vision, in particular to a method and system for video behavior recognition based on a spatial enhancement module. Background technique [0002] Human behavior recognition has always been a key research issue in the field of computer vision. Through the study of human body posture and behavior, abnormal behaviors in public places can be detected, such as: running fast, falling, hitting people, etc. Applying deep learning theory to computer vision applications can design high-precision and high-efficiency behavior recognition algorithms. A highly practical behavior recognition algorithm has played an important role in guaranteeing public safety, and it also enables the computer to automatically complete the detection of abnormal human behaviors, saving the time for managers to search manually. The purpose of video behavior recognition is to identify the specific cat...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G06V20/40G06N3/04G06N3/08
CPCG06N3/08G06V20/41G06N3/045
Inventor 胡永健蔡德利刘琲贝王宇飞
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products