Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Cross-modal pedestrian re-identification method based on adaptive pedestrian alignment

A pedestrian re-recognition, cross-modal technology, applied in character and pattern recognition, instruments, calculations, etc., can solve the problems of image misalignment, visible light image difference, misalignment, etc., to achieve the effect of improving accuracy

Active Publication Date: 2021-04-13
SICHUAN UNIV
View PDF6 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in the process of image data collection, due to reasons such as unsatisfactory camera shooting angles and image post-processing errors, there may be a large number of image dislocations and misalignment phenomena between images in a single modality, resulting in There can be huge differences within visible light images

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cross-modal pedestrian re-identification method based on adaptive pedestrian alignment
  • Cross-modal pedestrian re-identification method based on adaptive pedestrian alignment
  • Cross-modal pedestrian re-identification method based on adaptive pedestrian alignment

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0012] The present invention will be further described below in conjunction with accompanying drawing:

[0013] The network structure and principle of the MAPAN model are as follows:

[0014] This network model framework learns feature representations and distance metrics in an end-to-end manner while maintaining high discriminability. It mainly consists of two parts: a multipath network for feature extraction and a fully connected layer for feature embedding. Specifically, the multipath network consists of three branches: the visible light affine transformation branch, the visible light base branch and the infrared image branch, none of which share weights. The visible light basic branch has the same structure as the infrared image branch, and both use the residual network ResNet50 as the pre-training model, which includes 5 downsampling blocks and 1 average pooling layer. The visible light affine transformation branch consists of a grid network, a bilinear sampler and a re...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a cross-modal pedestrian re-identification method based on adaptive pedestrian alignment. The method comprises the following steps: firstly, respectively extracting features of an infrared image and a visible light image by utilizing a multi-path network based on a residual network pre-training model ResNet50; then, linearly regressing a group of affine transformation parameters by utilizing the high-level characteristics of the visible light image to carry out adaptive affine transformation on the visible light image; after an aligned and corrected image is generated, extracting and fusing features of the image with features extracted from an original visible light image to serve as final features of the visible light image; and finally, mapping the features of the infrared image and the visible light image into the same feature space, and training in combination with an identity loss function and a most difficult batch sampling loss function to finally achieve higher recognition precision compared with a general cross-modal pedestrian re-recognition method. The method is mainly applied to a video monitoring intelligent analysis application system, and has wide application prospects in the fields of image retrieval, intelligent security and the like.

Description

technical field [0001] The present invention relates to a cross-modal pedestrian re-identification method based on adaptive pedestrian alignment, and a new network model MAPAN (Multipath Adaptive Pedestrian Alignment Network), which relates to the cross-modal pedestrian re-identification problem in the field of video intelligent monitoring, It belongs to the field of computer vision and intelligent information processing. Background technique [0002] Pedestrian re-identification (Re-Identification) is a technology in the field of computer vision, which aims to match specific pedestrians with the same identity (usually replaced by numbers) in different camera surveillance videos, and is generally considered to be a sub-problem of image retrieval. Pedestrian images captured by different cameras may cause differences in the appearance of pedestrians due to reasons such as viewing angles, pedestrian posture changes, and illumination changes. There may also be challenges such as...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/62
CPCG06V40/103G06F18/214G06F18/253
Inventor 何小海李波吴晓红刘强陈洪刚卿粼波滕奇志
Owner SICHUAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products