Spatial-temporal feature aggregation method and system combined with attention mechanism and terminal

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of spatiotemporal features and aggregation methods, applied in the field of computer vision, can solve the problems of recurrent neural network gradient divergence, inability to synthesize early image frame information well, and limit recognition accuracy, so as to improve the pedestrian recognition rate.

Pending Publication Date: 2020-11-20

SHANGHAI JIAO TONG UNIV

View PDF2 Cites 9 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the recurrent neural network has the problem of gradient divergence, cannot well synthesize the information of the early image frames, and lacks the necessary spatial information, which limits the final recognition accuracy

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0044] The following is a detailed description of the embodiments of the present invention: this embodiment is implemented on the premise of the technical solution of the present invention, and provides detailed implementation methods and specific operation processes. It should be noted that those skilled in the art can make several modifications and improvements without departing from the concept of the present invention, and these all belong to the protection scope of the present invention.

[0045] An embodiment of the present invention provides a spatio-temporal feature aggregation method with a joint attention mechanism. The method uses an attention mechanism to make the recognition network pay more attention to high-quality pedestrian image samples to solve the problems existing in the prior art. Extract more effective spatial features. However, in temporal feature extraction, different samples in video frames also contain different contribution levels, and different att...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a spatial-temporal feature aggregation method and system combined with an attention mechanism, and a terminal, and the method comprises the steps: extracting the spatial domainfeatures of a pedestrian in a deep network through a convolutional neural network, and obtaining the time domain features of the pedestrian through the spatial domain features comprehensively extracted through a recurrent neural network; respectively generating corresponding quality-sensitive attention scores and frame-sensitive attention scores by adopting a feature extraction network so as to dynamically fuse spatial domain and time domain features; carrying out linear superposition fusion to obtain quality-sensitive spatial domain features and frame-sensitive time domain features to obtainpedestrian space-time feature expression; carrying out network training on the upper, middle and lower parts of a pedestrian to obtain corresponding local features with complementary properties, and obtaining feature expressions with higher discrimination through splicing. The method and system have good robustness, and can better solve and adapt to the conditions of shielding, light change and the like; and by combining the spatial domain and time domain characteristics of the pedestrian, the detail characteristics of the pedestrian are mined, so that the method and system can play better performance and efficiency in the next step of pedestrian recognition.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and in particular relates to a spatio-temporal feature aggregation method, system and terminal of a joint attention mechanism. Background technique [0002] Pedestrian re-identification is a key task in intelligent video surveillance, and it has been a research hotspot in the field of computer vision in recent years. It is suitable for security and public places and other technical fields. Pedestrian re-identification can be defined as: in a non-overlapping video surveillance network, for a given person in one camera, determine whether it appears in other cameras or not. It is an automatic target recognition technology, which can quickly locate interested human targets in the surveillance network, and is an important step in applications such as intelligent video surveillance and human behavior analysis. [0003] How to extract enough discriminative features from limited data is a key ch...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/00G06N3/04

CPCG06N3/049G06V40/25G06V40/10G06N3/044

Inventor 杨华陈琳

Owner SHANGHAI JIAO TONG UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Spatial-temporal feature aggregation method and system combined with attention mechanism and terminal

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology