Bidirectional long short-term memory unit-based behavior identification method for video

A long-short-term memory and recognition method technology, which is applied in the field of automatic recognition of the behavior of characters in videos, can solve problems such as inability to accurately describe videos, and achieve the effect of ensuring accuracy and improving accuracy

Inactive Publication Date: 2017-06-13
SUZHOU UNIV
View PDF4 Cites 96 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Videos cannot be accurately desc

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Bidirectional long short-term memory unit-based behavior identification method for video
  • Bidirectional long short-term memory unit-based behavior identification method for video
  • Bidirectional long short-term memory unit-based behavior identification method for video

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0040] Embodiment one: see figure 1 As shown, it is a behavior recognition method based on two-way long-short-term memory unit for video, which uses deep learning features combined with two-way long-short-term memory unit for behavior recognition. In order to select a powerful feature representation, we use multi-layer deep learning features to replace traditional hand-designed features to improve the performance of action recognition. In order to fully explore the temporal information, a bidirectional long-short-term memory unit (Bi-LSTM) is used for modeling, which can capture changes in temporal sequences in two directions, and the information provided is better than that of a unidirectional long-short-term memory unit.

[0041]1. Convolutional neural network and its multi-layer features

[0042] In order to extract effective expressive features, a deep convolutional network needs to be trained. This embodiment uses the simple and effective Caffe architecture to build the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a bidirectional long short-term memory unit-based behavior identification method for a video. The method comprises the steps of (1) inputting a video sequence and extracting an RGB (Red, Green and Blue) frame sequence and an optical flow image from the video sequence; (2) respectively training a deep convolutional network of an RGB image and a deep convolutional network of the optical flow image; (3) extracting multilayer characteristics of the network, wherein characteristics of a third convolutional layer, a fifth convolutional layer and a seventh fully connected layer are at least extracted, and the characteristics of the convolutional layers are pooled; (4) training a recurrent neural network constructed by use of a bidirectional long short-term memory unit to obtain a probability matrix of each frame of the video; and (5) averaging the probability matrixes, finally fusing the probability matrixes of an optical flow frame and an RGB frame, taking a category with a maximum probability as a final classification result, and thus realizing behavior identification. According to the method, the conventional artificial characteristics are replaced with multi-layer depth learning characteristics, the depth characteristics of different layers represent different pieces of information, and the combination of multi-layer characteristics can improve the accuracy rate of classification; and the time information is captured by use of the bidirectional long short-term memory, many pieces of time domain structural information are obtained and a behavior identification effect is improved.

Description

technical field [0001] The invention relates to a video processing method, in particular to a method for automatically recognizing the behavior of characters in a video. Background technique [0002] Behavior recognition refers to the analysis of the behavior of the target by extracting the feature information in the video or image sequence to identify the behavior patterns of the characters in the video. [0003] Action recognition is an important and difficult subject in computer vision and pattern recognition. It has broad application prospects in many aspects, such as intelligent monitoring, human-computer interaction, virtual reality, intelligent security and so on. In today's society, with the rapid development of the economy, people pay more and more attention to safety issues. More and more places have installed video surveillance cameras, and a large number of surveillance videos are generated every day. Now people generally use manual surveillance. This It needs ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/62G06N3/04
CPCG06N3/049G06V40/20G06V20/46G06F18/25G06F18/2415
Inventor 刘纯平葛瑞季怡刘海宾龚声蓉
Owner SUZHOU UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products