Cross-modal pedestrian re-identification method based on double-transformation alignment and blocking

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A pedestrian re-identification, cross-modal technology, applied in the direction of character and pattern recognition, instruments, computer components, etc., can solve the semantic dislocation of image space, limit the robustness and effectiveness of person re-identification technology, and different content semantics, etc. question

Pending Publication Date: 2021-12-07

SICHUAN UNIV

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Many practical problems will lead to spatial semantic dislocation between images, that is, the content semantics of two matching images corresponding to the same spatial location are different, thus limiting the robustness and effectiveness of person re-identification technology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0016] Attached below figure 1 , attached figure 2 And attached image 3 The present invention is described further:

[0017] The network structure and principle of the DTASN model are as follows:

[0018] This network model framework learns feature representations and distance metrics in an end-to-end manner through multipath dual-alignment and block networks while maintaining high discriminability. The framework consists of three components: (1) feature extraction module, (2) feature embedding module, and (3) loss calculation module. The backbone network of all paths is the deep residual network ResNet50. Due to the lack of available data, in order to speed up the convergence speed of the training process, the present invention uses a pre-trained ResNet50 model to initialize the network. To strengthen the attention on local features, the present invention applies a position attention module on each path.

[0019] For visible light and infrared cross-modal pedestrian r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a cross-modal pedestrian re-identification method based on double transformation alignment and blocking. The method comprises steps of firstly, a base branch network being used for extracting input infrared and visible light pedestrian image features, a group of affine transformation parameters being linearly regressed by using high-level features of the images, then an alignment image being generated by using the parameters, and the image being capable of effectively relieving modal difference of misalignment; next, horizontally dividing the aligned image into three blocks, then taking out the features of the three block images, and fusing the features with the aligned global features and original image features as the total features of the visible light and infrared images; next, the total features of the infrared and visible light images being mapped to the same embedding space; and finally, carrying out joint training by combining identity loss and the most difficult batch sampling loss function with the weight so as to improve the recognition precision. The method is mainly applied to a video monitoring intelligent analysis application system, and has wide application prospects in the fields of image retrieval, intelligent security and protection and the like.

Description

technical field [0001] The present invention relates to a cross-modal pedestrian re-identification method based on double transform alignment and segmentation, and a new network model DTASN (Dual transform alignment and segmentation network), which relates to cross-modal pedestrian re-identification in the field of video intelligent monitoring The recognition problem belongs to the field of computer vision and intelligent information processing. Background technique [0002] Person Re-Identification (Person Re-Identification, ReID) is a technology in the field of computer vision. The purpose of person re-identification is to retrieve the person of interest in multiple non-overlapping cameras, which is usually considered as a subclass of image retrieval. question. An efficient ReID algorithm can ease the pain of video viewing and speed up the investigation process. The broad application prospects of pedestrian re-identification in video surveillance, intelligent security an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/00G06K9/62

CPCG06F18/253G06F18/214

Inventor 陈洪刚刘强滕奇志何小海卿粼波吴晓红

Owner SICHUAN UNIV

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Cross-modal pedestrian re-identification method based on double-transformation alignment and blocking

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology