Video action detection method based on scale attention hole convolution network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A convolutional network, motion detection technology, applied in the field of video analysis, can solve problems such as increasing network construction and training time and space costs, different semantic interference, contextual semantic segmentation, etc.

Active Publication Date: 2020-09-01

HANGZHOU DIANZI UNIV

View PDF7 Cites 17 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] The existing video action detection methods mainly have the following deficiencies: First, in the feature extraction stage, the three-dimensional convolution operation used to extract the timing features of the action will fixedly reduce the timing dimension of the input video layer by layer in the constructed network model, Constrains the scale size of the extracted features in time series. Too small a scale may cause contextual semantic segmentation, and too large a scale may cause interference of different semantics. The key points of whether or not and its type, that is, the key frame position and its duration (such as continuous key frames) are often different, and the conventional average pooling operation ignores the weight of the key points; third, the existing methods for different scales Fragments use different network structures (such as dilated convolutional networks) to extract feature representations of action clips, which will greatly increase the time and space costs of network construction and training

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0029] The present invention will be further described below in conjunction with accompanying drawing.

[0030] A video action detection method based on the scale-attention hole convolutional network. First, the video is sampled to obtain the frame image sequence and the video segment is obtained according to the action segment mark, and then the layer-scale attention action segment model and the frame position attention action recognition are respectively constructed. model, and finally combined with the watershed algorithm to determine the action category to which the video clip belongs. This method uses the dilated convolutional network to more accurately capture the temporal and spatial motion information of video data, uses the layer-scale attention mechanism to describe the temporal context of video frames, and uses the frame position attention mechanism to learn appropriate weights for the video frames of the action clips. Good reflection of the content of the action cl...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a video action detection method based on a scale attention hole convolutional network. The method comprises the following steps: firstly, sampling a video to obtain a frame image sequence, and marking according to fragment positions to obtain video fragments; and then respectively constructing a layer-scale attention action fragment model and a frame position attention action recognition model, and according to the models and in combination with a watershed algorithm, sequentially obtaining weighted feature representation of a frame image and an action category to whicha video fragment belongs, thereby finishing a video action detection task. According to the method, the space-time motion information capable of better reflecting the intrinsic structures of the timedimension and the space dimension of the video data is extracted by utilizing the hole convolution network; the intrinsic relation of the time sequence context relation of the video frames along withthe change of the size of the scale is more properly described through the layer scale attention, the designed frame position attention mechanism gives the video frames of the action clips a weight more accurately representing the key content of the action clips, the accuracy of video action detection is improved, and the efficiency of action detection is improved.

Description

technical field [0001] The invention belongs to the technical field of video analysis, in particular to the technical field of time series action detection, and relates to a video action detection method based on a scale-attention hole convolution network. Background technique [0002] The understanding of human motion video plays an important role in many fields such as security monitoring and behavior analysis, and has become a frontier research topic in the field of computer vision. However, unedited real videos often contain background segments unrelated to human actions, which will affect the correct understanding of video content. To solve this problem, the video action detection method not only classifies the actions in the video, but also locates the start and end time of the action instance in the video. Video action detection tasks usually use video frame sequences as input, and output the detection results of multiple groups of segments in the form of "action typ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/00G06K9/62G06N3/04G06N3/08

CPCG06N3/08G06V40/20G06V40/10G06V20/46G06V20/52G06N3/047G06N3/045G06F18/241G06F18/2415

Inventor 李平曹佳晨陈乐聪徐向华

Owner HANGZHOU DIANZI UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Video action detection method based on scale attention hole convolution network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology