Deep learning-based human body motion recognition method of multi-channel image feature fusion

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A human action recognition and image feature technology, applied in character and pattern recognition, instruments, biological neural network models, etc., can solve the problems of small inter-class differences, inability to realize high-precision human action recognition, and large internal differences

Inactive Publication Date: 2018-07-17

SOUTH CHINA UNIV OF TECH

View PDF2 Cites 60 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0003] Although the traditional method based on manual features and the method based on deep learning have achieved good classification performance in human action recognition, due to the complexity of human action, the interference of background factors in the video, and the large intra-class difference of individual actions, the inter-class difference is very large. For small reasons, the current recognition algorithms have certain deficiencies, and cannot achieve high-precision human action recognition.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment

[0052] Such as Figure 1 to Figure 2 As shown, the human action recognition method based on the fusion of multi-channel image features of deep learning in the present invention is used to identify the human action in the video; including the following four steps:

[0053] (1) Extract the original RGB picture from the video, and calculate the dynamic map and optical flow map of the segmented video through the RGB picture;

[0054] (2) Carry out cropping operation to the input picture and amplify the training data set;

[0055] (3) Construct a three-channel convolutional neural network, and input the finally obtained video clips into the three-channel convolutional neural network for training to obtain a corresponding network model;

[0056] (4) For the video clip to be recognized, extract the original RGB image, and calculate its corresponding dynamic graph and optical flow graph, use the three-channel convolutional neural network trained in (3) to extract features, and obtain...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a deep learning-based human body motion recognition method of multi-channel image feature fusion. The method comprises: (1) extracting original RGB pictures from videos, and calculating dynamic graphs and optical flow graphs of the segmented videos through the RGB pictures; (2) carrying out cropping operations on the input pictures to expand a training data set; (3) constructing a three-channel convolutional-neural-network, and respectively inputting lastly obtained video segments into the three-channel convolutional-neural-network to carry out training to obtain a corresponding network model; and (4) for a to-be-recognized video segment, extracting original RGB pictures, calculating dynamic graphs and optical flow graphs corresponding thereto, and obtaining a recognition result of a final motion category. According to the method, the three-channel convolutional-neural-network is utilized for learning essential features of data for original input of different morphologies, multi-channel dense fusion operations are carried out on the data of the three morphologies in the middle of the network, expression ability of the features is improved, and purposes of multi-channel information sharing and a high accuracy degree are achieved.

Description

technical field [0001] The present invention relates to the technical field of image processing and analysis, and more specifically, relates to a human action recognition method based on multi-channel image feature fusion based on deep learning. Background technique [0002] Human action recognition in video refers to a technology for human action recognition and classification by analyzing and processing visual feature information in video. This technology is widely used in intelligent video surveillance, behavior analysis, video retrieval and so on. Traditional human action recognition is based on manually designed feature training classifiers for action classification. At present, the strategy with the best effect of the traditional method is to extract features based on the improved dense trajectory (improved Dense Trajectory, iDT), combined with Fisher Vector (Fisher Vector, FV) modeling to identify human body work. In recent years, with the rapid development of deep ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/00G06K9/62G06N3/04

CPCG06V40/23G06V20/40G06N3/045G06F18/241G06F18/253G06F18/214

Inventor 张见威钟佳琪

Owner SOUTH CHINA UNIV OF TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Deep learning-based human body motion recognition method of multi-channel image feature fusion

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology