Video behavior recognition method based on weighted fusion of multiple image tasks

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of weighted fusion and recognition method, which is applied in the direction of character and pattern recognition, instruments, computer parts, etc., can solve the problems of time-consuming and labor-intensive, and achieve the effect of expanding and reducing the distance

Pending Publication Date: 2021-10-22

BEIJING INSTITUTE OF TECHNOLOGYGY

View PDF3 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0010] The present invention fully considers the correlation between different visual tasks in the field of computer vision, and the characteristics that the prior knowledge of related tasks can be transferred and utilized, and proposes a video behavior recognition method for weighted fusion of multiple image tasks. The video behavior The recognition method solves the time-consuming and laborious problem of marking large-scale high-quality video training samples in the existing video behavior recognition task

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0052] Step 1. Collect human body activity video data sets, segment them according to the categories of human body behavior in the video, and assign category labels, perform frame extraction and normalization processing on the video data, and divide them into training sets and test sets. The specific method is as follows:

[0053] Step 1.1. The collection of video data includes self-built video data sets or using existing public data sets: first download the relevant data set files from the official website, the specific data set is: HMDB51 is a video behavior recognition with 51 action tags The data set has a total of 6849 videos, and each action contains at least 51 videos. The actions mainly include: facial actions such as smiling, chewing, talking, facial and object interactions such as smoking, eating, drinking; body actions such as clapping, climbing, jumping, Running, actions that interact with objects such as combing hair, dribbling, and playing golf, and interactions b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a video behavior recognition method based on weighted fusion of multiple image tasks. The video behavior recognition method comprises the following specific steps: 1, constructing an initialized teacher network; 2, downloading and selecting a plurality of pre-training models and parameters of a visual image task common data set in positive correlation with video behavior identification as an initialized teacher network; 3, establishing a multi-teacher video behavior recognition knowledge base; 4, under the guidance of the multi-teacher network with the weight distributed again, carrying out self-supervised training based on comparative learning on the student network; 5, carrying out performance test on the model video behavior identification on the test data set. The method has the advantages that the image task in positive correlation with the video behavior recognition task serving as the target task is used as the teacher task, the training mode of comparison self-supervised learning is adopted, and the video behavior recognition problem under the condition that high-quality video marking samples are insufficient is solved. Therefore, the accuracy of video behavior identification is effectively improved.

Description

technical field [0001] The invention relates to the technical field of video behavior analysis, in particular to the design of a video behavior recognition method for weighted fusion of multiple image tasks. Background technique [0002] Behavior recognition is an attractive and challenging research direction in recent years, that is, given a cropped video, judge the human behavior category in this video through computer vision technology. The development of deep convolutional neural networks and the emergence of large-scale labeled datasets in recent years have significantly improved the accuracy of action recognition. Behavior recognition technology is playing an increasingly important role in many fields such as intelligent security, human-computer interaction, video understanding, and medical health. [0003] The existing supervised learning-based Deep Convolutional Neural Network (Deep CNN) model algorithm has achieved relatively ideal results. However, in order to ob...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/00

CPCG06F18/22G06F18/2155G06F18/2415G06F18/253

Inventor 高广宇刘驰李金洋

Owner BEIJING INSTITUTE OF TECHNOLOGYGY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Video behavior recognition method based on weighted fusion of multiple image tasks

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology