Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Video recognition method based on space-time pyramid network

A video recognition, space-time technology, applied in character and pattern recognition, biological neural network models, instruments, etc., can solve problems such as poor recognition effect, achieve the effects of avoiding recognition errors, improving computing efficiency, and saving computing time

Inactive Publication Date: 2018-04-13
TSINGHUA UNIV
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0004] However, these methods perform poorly when two videos have similar backgrounds or are similar in short time segments.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video recognition method based on space-time pyramid network
  • Video recognition method based on space-time pyramid network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. The following examples are used to illustrate the present invention, but are not intended to limit the scope of the present invention.

[0019] figure 1 It is a flowchart of a video recognition method based on a spatio-temporal pyramid network according to an embodiment of the present invention, such as figure 1 As shown, the method includes: including: S1, acquiring a video segment sample set and an image sample, the video segment sample set including video segment samples corresponding to each time zone in the video to be tested, and each time zone corresponds to a sample point, the image sample is an image corresponding to the midpoint of the video clip, and the video clip is composed of all video clip samples; S2, extracting the image of each video clip sample in the video clip sample set by the first ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a video recognition method based on a space-time pyramid network. The method comprises steps of extracting characteristics of each video clip sample in a video sample set through the convolution neural network, carrying out time-space linear operator processing so as to acquire a first vector and through a second convolution neural network, acquiring image information of image samples and acquiring a second vector; carrying out time-space linear operator processing on the vector obtained by splicing the first vector and the second vector; carrying out weighting pooling on an output result and the second vector so as to acquire a third vector; through average pooling, acquiring a fourth vector and a fifth vector and then carrying time-space linear operator processingso as to acquire a sixth vector; and according to a loss value, recognizing a to-be-detected video. According to the invention, through the dimension reduction operation and inverse transformation operation, problems of curse of dimensionality of bilinearity fusion and high operation complexity are solved; and by improving the bilinearity fusion operators, under the condition that two videos havethe similar background or the similar short films, better recognition effects are acquired.

Description

technical field [0001] The invention relates to the field of computer data analysis, and more specifically, to a video recognition method based on a space-time pyramid network. Background technique [0002] With the development of artificial intelligence, machine learning has been applied in various fields, such as face recognition, weather prediction, etc. Nowadays, artificial intelligence has been widely used in the field of computer vision, and image recognition technology has achieved very good accuracy, but there is still room for improvement in video data processing and video action recognition. [0003] Obtaining a better representation of video data is the basis of many computer vision tasks, such as action recognition and video subtitle recognition. Video data analysis is not only image analysis but also depends on the joint model of time and space. Many methods now take advantage of convolutional networks to build such models. However, since the framework of Con...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/00G06K9/62G06N3/04
CPCG06V20/40G06N3/045G06F18/213
Inventor 龙明盛王建民王韫博黄向东
Owner TSINGHUA UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products