Cross-modal pedestrian re-identification method based on double-transformation alignment and blocking

A pedestrian re-identification, cross-modal technology, applied in the direction of character and pattern recognition, instruments, computer components, etc., can solve the semantic dislocation of image space, limit the robustness and effectiveness of person re-identification technology, and different content semantics, etc. question

Pending Publication Date: 2021-12-07
SICHUAN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Many practical problems will lead to spatial semantic dislocation between images, that is, the content semantics of two matching images corresponding to the same spatial location are different, thus limiting the robustness and effectiveness of person re-identification technology

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-modal pedestrian re-identification method based on double-transformation alignment and blocking
  • Cross-modal pedestrian re-identification method based on double-transformation alignment and blocking
  • Cross-modal pedestrian re-identification method based on double-transformation alignment and blocking

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0016] Attached below figure 1 , attached figure 2 And attached image 3 The present invention is described further:

[0017] The network structure and principle of the DTASN model are as follows:

[0018] This network model framework learns feature representations and distance metrics in an end-to-end manner through multipath dual-alignment and block networks while maintaining high discriminability. The framework consists of three components: (1) feature extraction module, (2) feature embedding module, and (3) loss calculation module. The backbone network of all paths is the deep residual network ResNet50. Due to the lack of available data, in order to speed up the convergence speed of the training process, the present invention uses a pre-trained ResNet50 model to initialize the network. To strengthen the attention on local features, the present invention applies a position attention module on each path.

[0019] For visible light and infrared cross-modal pedestrian r...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a cross-modal pedestrian re-identification method based on double transformation alignment and blocking. The method comprises steps of firstly, a base branch network being used for extracting input infrared and visible light pedestrian image features, a group of affine transformation parameters being linearly regressed by using high-level features of the images, then an alignment image being generated by using the parameters, and the image being capable of effectively relieving modal difference of misalignment; next, horizontally dividing the aligned image into three blocks, then taking out the features of the three block images, and fusing the features with the aligned global features and original image features as the total features of the visible light and infrared images; next, the total features of the infrared and visible light images being mapped to the same embedding space; and finally, carrying out joint training by combining identity loss and the most difficult batch sampling loss function with the weight so as to improve the recognition precision. The method is mainly applied to a video monitoring intelligent analysis application system, and has wide application prospects in the fields of image retrieval, intelligent security and protection and the like.

Description

technical field [0001] The present invention relates to a cross-modal pedestrian re-identification method based on double transform alignment and segmentation, and a new network model DTASN (Dual transform alignment and segmentation network), which relates to cross-modal pedestrian re-identification in the field of video intelligent monitoring The recognition problem belongs to the field of computer vision and intelligent information processing. Background technique [0002] Person Re-Identification (Person Re-Identification, ReID) is a technology in the field of computer vision. The purpose of person re-identification is to retrieve the person of interest in multiple non-overlapping cameras, which is usually considered as a subclass of image retrieval. question. An efficient ReID algorithm can ease the pain of video viewing and speed up the investigation process. The broad application prospects of pedestrian re-identification in video surveillance, intelligent security an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/62
CPCG06F18/253G06F18/214
Inventor 陈洪刚刘强滕奇志何小海卿粼波吴晓红
Owner SICHUAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products