Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Real-time video target tracking algorithm based on multilayer attention mechanism

A real-time video and target tracking technology, applied in the field of image processing, can solve the problems of model drift, tracking failure, and weak model discrimination, and achieve the effect of improving robustness and discrimination.

Pending Publication Date: 2019-07-05
NANJING UNIV OF INFORMATION SCI & TECH
View PDF0 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the target tracking algorithm (SiamFC) method based on the fully convolutional twin network uses a shallower convolutional neural network designed for classification tasks during offline training, which makes the representation ability of the model weak, and serious deformation occurs in the target. Or when similar objects appear, the model is prone to drift due to the weak discriminative power of the model, resulting in tracking failure

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Real-time video target tracking algorithm based on multilayer attention mechanism
  • Real-time video target tracking algorithm based on multilayer attention mechanism
  • Real-time video target tracking algorithm based on multilayer attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0029] The real-time video target tracking algorithm based on the twin network of multi-layer attention mechanism provided by the present invention, the process is as follows figure 1 , figure 2 Shown, specifically include the following steps:

[0030] Step 1: Obtain the input image, the first frame of the input image is the template image, and the subsequent frames are the search image;

[0031] Step 2: The present invention embeds the attention mechanism into the 2nd, 7th, and 12th layers of the convolutional network as a feature extraction network, which is used to simultaneously capture the low-level information, middle-level information, and high-level semantic information of the target, And use the attention mechanism to establish the channel and spatial connection between the feature maps. Its specific implementation is as follows:

[0032] At the same time, the channel and spatial information of the input feature map are processed. The channel information processin...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a real-time video target tracking algorithm based on a multilayer attention mechanism. The real-time video target tracking algorithm comprises the following steps of inputting aseries of template images and search images; constructing a multilayer attention convolutional neural network as a matching function; calculating the similarity between the template image and the search image by utilizing related operations; using a logic loss function as an optimized objective function; optimizing the network parameters by adopting momentum random gradient descent; fixing the model parameters during the online tracking of the convolutional network, taking a first frame of target as a template to be inputted into the network, inputting the subsequent frames sequentially intothe network to carry out the correlation calculation with the output obtained by the first frame, and obtaining a response score graph; and finding the position with the highest score in the responsegraph as the position of the target, and amplifying the target to the size the same as that of the original graph to obtain the tracking target. According to the multilayer attention twinning networktracking algorithm provided by the invention, an attention mechanism can be effectively utilized to separate a target from a background, and the accuracy and the generalization of the tracker are remarkably improved under the conditions of target rotation, deformation and partial shielding.

Description

technical field [0001] The invention relates to a real-time video target tracking algorithm based on a multi-layer attention mechanism, which belongs to the field of image processing. Background technique [0002] The goal of object tracking is to track any temporally changing object specified with a bounding box only in the first frame. Object tracking is one of the important components of various applications in the field of computer vision, such as unmanned driving, video surveillance, human-computer interaction and other fields. Although research in this field has made significant progress in the past decade, due to the arbitrary potential changes of the target and the unknown and continuously changing background of the video, partial occlusion, fast motion, Due to the interference of factors such as scale changes, the tracking of arbitrary targets has been very challenging so far. [0003] In recent years, as deep learning has achieved great success in the field of co...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T7/246
CPCG06T7/246G06T2207/10016G06T2207/20081G06T2207/20084
Inventor 宋慧慧杨康张开华刘青山
Owner NANJING UNIV OF INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products