Spatial-temporal feature aggregation method and system combined with attention mechanism and terminal

A technology of spatiotemporal features and aggregation methods, applied in the field of computer vision, can solve the problems of recurrent neural network gradient divergence, inability to synthesize early image frame information well, and limit recognition accuracy, so as to improve the pedestrian recognition rate.

Pending Publication Date: 2020-11-20
SHANGHAI JIAO TONG UNIV
View PDF2 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the recurrent neural network has the problem of gradient divergence, cannot well synthesize the information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Spatial-temporal feature aggregation method and system combined with attention mechanism and terminal
  • Spatial-temporal feature aggregation method and system combined with attention mechanism and terminal
  • Spatial-temporal feature aggregation method and system combined with attention mechanism and terminal

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The following is a detailed description of the embodiments of the present invention: this embodiment is implemented on the premise of the technical solution of the present invention, and provides detailed implementation methods and specific operation processes. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention, and these all belong to the protection scope of the present invention.

[0045] An embodiment of the present invention provides a spatio-temporal feature aggregation method with a joint attention mechanism. The method uses an attention mechanism to make the recognition network pay more attention to high-quality pedestrian image samples to solve the problems existing in the prior art. Extract more effective spatial features. However, in temporal feature extraction, different samples in video frames also contain different contribution levels, and different att...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a spatial-temporal feature aggregation method and system combined with an attention mechanism, and a terminal, and the method comprises the steps: extracting the spatial domainfeatures of a pedestrian in a deep network through a convolutional neural network, and obtaining the time domain features of the pedestrian through the spatial domain features comprehensively extracted through a recurrent neural network; respectively generating corresponding quality-sensitive attention scores and frame-sensitive attention scores by adopting a feature extraction network so as to dynamically fuse spatial domain and time domain features; carrying out linear superposition fusion to obtain quality-sensitive spatial domain features and frame-sensitive time domain features to obtainpedestrian space-time feature expression; carrying out network training on the upper, middle and lower parts of a pedestrian to obtain corresponding local features with complementary properties, and obtaining feature expressions with higher discrimination through splicing. The method and system have good robustness, and can better solve and adapt to the conditions of shielding, light change and the like; and by combining the spatial domain and time domain characteristics of the pedestrian, the detail characteristics of the pedestrian are mined, so that the method and system can play better performance and efficiency in the next step of pedestrian recognition.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and in particular relates to a spatio-temporal feature aggregation method, system and terminal of a joint attention mechanism. Background technique [0002] Pedestrian re-identification is a key task in intelligent video surveillance, and it has been a research hotspot in the field of computer vision in recent years. It is suitable for security and public places and other technical fields. Pedestrian re-identification can be defined as: in a non-overlapping video surveillance network, for a given person in one camera, determine whether it appears in other cameras or not. It is an automatic target recognition technology, which can quickly locate interested human targets in the surveillance network, and is an important step in applications such as intelligent video surveillance and human behavior analysis. [0003] How to extract enough discriminative features from limited data is a key ch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06N3/04
CPCG06N3/049G06V40/25G06V40/10G06N3/044
Inventor 杨华陈琳
Owner SHANGHAI JIAO TONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products