Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Video behavior identification method based on hybrid multi-scale time sequence separable convolution operation

A recognition method and multi-scale technology, applied in the field of machine vision and deep learning, can solve the problem of different action lengths

Active Publication Date: 2020-06-09
PEKING UNIV
View PDF6 Cites 20 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] In order to overcome the shortcomings of the above-mentioned prior art, the present invention provides a spatio-temporal modeling method based on hybrid multi-scale temporal depth separable convolution operation, which is used to solve the different lengths of actions in videos and different temporal changes of different semantic features in space. The problem of scale can be applied to but not limited to the video understanding task of video behavior recognition, and can efficiently realize video behavior recognition
[0005] The present invention fuses depth-separable convolutions of different sizes into one depth-separable convolution operation, and performs timing modeling of different scales on the features of different channels, so as to solve the different lengths of actions in videos and different semantic features in space. Problems with different scales of temporal variation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video behavior identification method based on hybrid multi-scale time sequence separable convolution operation
  • Video behavior identification method based on hybrid multi-scale time sequence separable convolution operation
  • Video behavior identification method based on hybrid multi-scale time sequence separable convolution operation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] Below in conjunction with accompanying drawing, further describe the present invention through embodiment, but do not limit the scope of the present invention in any way.

[0042] The present invention provides a high-efficiency video behavior recognition based on hybrid multi-scale time-series separable convolution. By integrating depth-separable 1D convolution kernels of different sizes into one convolution operation, the simultaneous recognition of long-sequence actions and short-sequence actions is realized. Modeling of actions.

[0043] Such as figure 1 As shown, the high-efficiency video behavior recognition based on the hybrid multi-scale time-series separable convolutional network established by the present invention is adopted. figure 2 Shown is the flow process of the video behavior recognition provided by the present invention, and specific implementation includes the following steps:

[0044] 1) Video frame extraction;

[0045] Extract the original video...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a video behavior identification method based on hybrid multi-scale time sequence separable convolution operation. The method comprises the following steps: extracting an original video into a picture sequence, dividing the picture sequence into a plurality of intervals, extracting a picture from each interval to form a picture sub-sequence, performing feature extraction onthe picture sub-sequence, and classifying the features of the picture sub-sequence to obtain a classification result serving as the category of behaviors occurring in the video; adopting a convolutional neural network model in which hybrid multi-scale time sequence separable convolution is added as a learner, extracting semantic features of picture sub-sequences, and using full connection layers in the convolutional neural network model for classifying the extracted picture sub-sequence features. The method is used for solving the problems that the lengths of actions in the video are differentand different semantic features in the space have different time sequence change scales, can be applied to video understanding tasks in the aspects of video behavior recognition and the like, and canefficiently realize video behavior recognition.

Description

technical field [0001] The invention belongs to the technical field of machine vision and deep learning, and relates to video behavior recognition technology, in particular to a method for efficient video behavior recognition using mixed multi-scale timing depth separable convolution. Background technique [0002] The purpose of video behavior recognition is to, for a given piece of video, analyze the action category that occurs in the video and give the corresponding label. This task can be considered as a kind of video classification task. For subsequent tasks of understanding other video content, the video classification task is the foundation, so it plays a very important role. As a basic task, video behavior recognition is widely used in human-computer interaction, urban security monitoring and other scenarios. [0003] In the process of video behavior recognition, existing methods use neural networks for feature extraction. It is mainly divided into three types: 1) ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06V20/40G06V20/46G06N3/045G06F18/24
Inventor 王勇涛单开禹汤帜
Owner PEKING UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products