Behavior classification method based on twin three-dimensional convolutional neural network
A three-dimensional convolution and neural network technology, applied in the field of video speech understanding, can solve the problems of ignoring video time characteristics, unsatisfactory classification algorithm effect, and inability to apply calculation speed.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0020] The technical solutions of the present invention will be described in detail below with reference to the accompanying drawings and embodiments.
[0021] like figure 1 As shown, it is a schematic diagram of the framework of the Siamese three-dimensional convolutional neural network on which the present invention is based. Including the abstract temporal feature branch network and the abstract spatial feature branch network, the structures of the two branches are the same, and the convolution kernels used are all three-dimensional convolution kernels. The abstract temporal features are passed through a deconvolutional network to generate an optical flow field. The splicing of abstract features is end-to-end, i.e. where f cat represents the feature after concatenation, f s represents the abstract spatial feature, f t represents an abstract temporal feature, and f s , The classifier is composed of a fully connected layer, and the output dimension of the fully conne...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


