Training and visible light infrared visual tracking method based on adapter mutual learning model

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology for learning models and training methods, applied in the field of computer vision, can solve the problems of insufficient modal fusion in the RGBT tracking method, and achieve the effects of overcoming parameter redundancy, suppressing noise, and improving tracking performance.

Active Publication Date: 2020-03-10

ANHUI UNIVERSITY

View PDF3 Cites 7 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0008] The technical problem to be solved by the present invention is to provide training based on the adapter mutual learning model and a visible light and infrared visual tracking method to solve the insufficient fusion of modes in the RGBT tracking method The problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0055] Such as figure 1 , image 3 , Figure 4 , figure 1 It is a flow block diagram of Embodiment 1 of the present invention; image 3 It is a flowchart of the network model in the present invention; Figure 4 It is a flow chart of the adapter mutual learning module in the present invention; the training process based on the adapter mutual learning model includes the following steps;

[0056] S11. Build a network model; the network model is composed of multi-level adapter modules, Concatnate functions, and instance adapters connected in series in sequence. The multi-level adapter modules output feature maps of different modalities and obtain a whole by splicing the Concatnate function according to the channel dimension. The feature map is passed to the instance adapter for calculation;

[0057] image 3 It is a flow chart of the adapter mutual learning module in the present invention; as image 3 , in this embodiment, the multi-level adapter modules are respectively co...

Embodiment 2

[0084] Such as figure 2 , image 3 and Figure 4 , figure 2 It is a flow block diagram of Embodiment 2 of the present invention; image 3 It is a flowchart of the network model in the present invention; Figure 4 It is a flow chart of the adapter mutual learning module in the present invention; the visible light infrared vision tracking method based on the adapter mutual learning model comprises the following steps:

[0085] S21. Input the currently tracked video frame, and use Gaussian sampling to obtain candidate samples of the current frame around the target position predicted in the previous frame;

[0086] The first frame image provided by the tracking video sequence is used as the previous frame; from the previous frame and the truth box that frames the target location area, several samples are randomly generated according to the Gaussian distribution, and several iterations of training are performed to complete the network model initialization.

[0087] Specific...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a training and visible light infrared visual tracking method based on an adapter mutual learning model. The method comprises the following steps: S11, constructing a network model; S12, training the whole network model by using the manually labeled visible light thermal infrared data set; S21, obtaining a candidate sample of the current frame; S22, predicting a target position according to the candidate sample; and S23, judging whether the current frame is tracked successfully or not. According to the invention, an adapter mutual learning module is introduced on the basis of multiple adapters (including a mode adapter, a general adapter and an instance adapter) to realize bidirectional cross-modal information transmission, so that the complementary advantages of different feature learning modes in RGBT tracking are fully utilized to further improve the traceability.

Description

technical field [0001] This application relates to the field of computer vision, in particular to the training of adapter-based mutual learning models and the visible light infrared vision tracking method. Background technique [0002] Object tracking is a hot issue in the field of computer vision; object tracking is also one of the key technologies for unmanned driving, intelligent transportation and intelligent monitoring. [0003] The current method based on detection and tracking is mainly to train a binary classifier, which can effectively distinguish the background and the foreground. First, set a specific threshold under the condition of the bounding box (bounding box) of the first frame in the video sequence. To determine the positive and negative samples; in the subsequent frames of the video, Gaussian sampling is still performed near the predicted result of the previous frame, and the obtained samples are classified into positive and negative samples. The highest s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/20G06K9/62G06N3/04

CPCG06V10/143G06V2201/07G06N3/048G06N3/045G06F18/253G06F18/214

Inventor 李成龙钱存鹿安东汤进

Owner ANHUI UNIVERSITY

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Training and visible light infrared visual tracking method based on adapter mutual learning model

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology