Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A method and system for action video classification based on multi-level motion modeling

A video classification, multi-level technology, applied in neural learning methods, character and pattern recognition, biological neural network models, etc., can solve the problems of large video frame jumps, affecting the video classification effect, difficult motion modeling, etc., to improve the expression effect of ability

Active Publication Date: 2022-08-05
ZHEJIANG LAB
View PDF5 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the problem with these methods is that only the inter-segment motion information is considered, and it is difficult to carry out effective motion modeling due to the large jump of video frames between each segment, which affects the effect of video classification

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A method and system for action video classification based on multi-level motion modeling
  • A method and system for action video classification based on multi-level motion modeling
  • A method and system for action video classification based on multi-level motion modeling

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The specific embodiments of the present invention will be described in detail below with reference to the accompanying drawings. It should be understood that the specific embodiments described herein are only used to illustrate and explain the present invention, but not to limit the present invention.

[0028]The method of the present invention uses the Pytorch framework for experiments, and uses the stochastic gradient descent SGD optimizer and the MultiStepLR scheduler with an initial learning rate of 0.01. Set up training on the Something-Something V1 dataset for 60 iterations, adjusting the learning rate at the 30th, 45th and 55th iterations. batch size is 64, number of video segments , both branches of the network are initialized using the ResNet50 model pre-trained on ImageNet, where the 1D channel-wise convolution in each layer is initialized in a manner equivalent to the Temporal Shift operation in the TSM network. Following common settings, the sampled video...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an action video classification method and system based on multi-level motion modeling, which performs multi-level comprehensive modeling for intra-segment and inter-segment motion information, which includes two neural network branches: The video frames sampled in the segment are processed to extract the apparent information and inter-segment motion information of the foreground target; the intra-segment branch processes the difference between adjacent video frames in each video segment and is used to extract the segment of the foreground target. Intra-sports information. The frame difference features extracted by the intra-segment branch are used to weight the inter-segment branch features by channel, and the convolutional features of the last two branches are fused and jointly input into the classifier for video classification. The invention is simple in implementation method and flexible in means, and achieves a significant improvement in classification effect on the action video data set.

Description

technical field [0001] The present invention relates to the technical field of video classification, in particular to an action video classification method and system based on multi-level motion modeling. Background technique [0002] With the popularity of cameras and the explosion of video applications (such as Douyin, etc.), video accounts for an increasing proportion of network data, and research on action video classification tasks is widely used in technologies such as intelligent monitoring, autonomous driving, and human-computer interaction. The field has important application value. After 2012, video classification methods based on deep learning, especially convolutional neural networks (CNN), have gradually replaced traditional hand-designed features (such as IDT, etc.). There are two main ideas for processing video data: [0003] One is to model continuous video segments. The main methods are 3D convolution, (2+1)D convolution, etc. Among them, the (2+1)D convo...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06V20/40G06N3/04G06N3/08
CPCG06N3/08G06N3/045
Inventor 卢修生鲍虎军程乐超杨非宋明黎
Owner ZHEJIANG LAB
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products