Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

RGBT visual tracking method and system based on two-stage fusion structure search

A visual tracking and stage technology, applied in the field of computer vision, can solve a lot of repeated experiments, consume a lot of manpower and material resources, ignore the potential benefits of cross-layer integration, etc., to avoid repeated experiments and improve tracking performance

Pending Publication Date: 2021-12-24
ANHUI UNIVERSITY
View PDF1 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First, these manually designed RGBT trackers require extensive repeated experiments, expert experience, and scientific intuition
Second, these fusion strategies ignore the potential benefits of cross-layer fusion
Finally, due to the fixed structure, these trackers are usually difficult to deal with various challenges in the tracking process
[0004] The existing methods have the following disadvantages: 1) The manually designed fusion network requires a lot of repeated experiments, expert experience and scientific intuition, the fusion of different convolutional layers has different effects, and finding an optimal fusion structure requires a lot of manpower 2) Since the structure of the hand-designed model is fixed, it is difficult to deal with various challenges in tracking

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • RGBT visual tracking method and system based on two-stage fusion structure search
  • RGBT visual tracking method and system based on two-stage fusion structure search
  • RGBT visual tracking method and system based on two-stage fusion structure search

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0037] Such as figure 1 , 2As shown, a RGBT visual tracking method based on two-stage fusion structure search, which specifically includes two stages of offline search and online tracking:

[0038] Such as figure 1 As shown, a general search space is designed in the offline search stage, including different fusion methods of the VGG-M convolutional layer, and five activation functions Tanh, ReLU, PReLU, LReLU, and ReLU6. The size of the search space is exponentially related to the number of possible fusion layers, so the search space is gradually explored according to the number of fusion layers, which is consistent with the idea of ​​​​progressive neural structure search, starting from a simple fusion layer of 1, and sequentially expanding the fusion number of layers. Train a proxy function to further guide the exploration of the search space. In order to learn the commonality of targets in different videos, a multi-domain learning training method is used. Assume that K vi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an RGBT visual tracking method and system based on two-stage fusion structure search, belongs to the technical field of computer vision, and solves the problem of how to find an optimal fusion network structure for RGBT tracking based on robust instance representation so as to further improve the tracking performance. In order to extract shared feature representation having robustness to various challenges such as illumination variation, movement blur and scale variation, a multi-domain learning framework is introduced to search for a fusion network structure in a general fusion space offline. In the online tracking stage, the fusion structure of each video sequence is searched online in the fusion space perceived by the instance so as to cope with challenges specific to the instance; the two-stage search algorithm can dynamically update a video fusion strategy, so that an appropriate fusion network structure is found for RGBT tracking based on robust instance representation, and the tracking performance is further improved.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and relates to a RGBT visual tracking method and system based on two-stage fusion structure search. Background technique [0002] Object tracking is a hot issue in the field of computer vision. Object tracking is also one of the key technologies for unmanned driving, intelligent transportation and intelligent surveillance. Object tracking is to estimate the position of the object in subsequent frames given the bounding box of the initial frame. Most of the current tracking algorithms are based on the single-mode condition of visible light, which will be greatly affected under some extreme conditions, such as severe weather and strong changes in illumination, etc. The performance of single-mode tracking algorithms is often unsatisfactory. The modal fusion tracking of visible light and thermal infrared is called RGBT (Red Green Blue Thermal) tracking. Since visible light information and t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06N3/047G06N3/045G06F18/25
Inventor 汤进朱立顺李成龙
Owner ANHUI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products