Target tracking method based on coding and decoding structure

A target tracking, encoding and decoding technology, which is applied in the field of target tracking based on the encoder-decoder structure, can solve problems such as inequalities and feature extraction mismatches, and achieve the effect of reducing losses and fast convergence speed

Active Publication Date: 2020-09-22
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF23 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The L1 loss function regresses the four coordinate values ​​separately, but the loss function has the following disadvantages: first, when using the loss function to calculate the bounding box regression loss of the target detection, the loss of the four points is independently calculated, and then added to obtain The final bounding box regression loss, the assumption of this approach is that the four points are independent of each other, but in fact the four points are correlated; secondly, the actual evaluation frame detection index is to use the intersection ratio, which is not equivalent to L1
However, the pre-training task based on the ImageNet image database is based on the classification task, and the features extracted by the convolutional neural network are also more suitable for the classification task, which does not match the feature extraction requirements for the tracking task.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Target tracking method based on coding and decoding structure
  • Target tracking method based on coding and decoding structure
  • Target tracking method based on coding and decoding structure

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0017] The specific implementation manner and working principle of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0018] The present invention proposes a target tracking method based on a codec structure, which is realized through a target tracking network model. The structure of the target tracking network model is mainly composed of three parts, the first part is an encoder-decoder part: wherein the The encoder composed of a convolutional neural network extracts the deep features of the template frame (the image of the searched object) and the search frame (the position of the searched object in the frame), and the decoder, in the training part, takes the deep features of the template frame The feature is restored to the object image; the second part is the identification network, which is used to identify whether the image output by the decoder and the image input to the decoder belong to the same object; the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a target tracking method based on an encoding and decoding structure. According to the method, a similar generative adversarial network structure is generated through the combination of an encoder-decoder and a discriminator, the features extracted by an encoder are more generalized, and the essential features of a tracked object are learned. Due to the fact that the objects which are semi-shielded and affected by illumination and motion blur exist in the object frames, the influence on the network is smaller, and the robustness is higher. According to the method, FocalLoss is used for replacing a traditional cross entropy loss function, so that the loss of easy-to-classify samples in the network is reduced, the model pays more attention to difficult and misclassified samples, and meanwhile the number of positive and negative samples is balanced. Distance-U loss is used as regression loss, an overlapping region is concerned, other non-overlapping regions are concerned, scale invariance is achieved, the moving direction can be provided for a bounding box, and meanwhile the convergence speed is high.

Description

technical field [0001] The invention belongs to the field of image processing and computer vision, and in particular relates to an object tracking method based on an encoder-decoder structure. Background technique [0002] One of the main goals of computer vision is to enable computers to replicate basic functions of human vision, such as motion perception and scene understanding. To achieve the goal of intelligent motion perception, much effort has been devoted to visual object tracking, which is one of the most important and challenging research topics in computer vision. Essentially, the core of visual object tracking is to reliably estimate the motion state (i.e., position, orientation, size, etc.) of the target object in each frame of the input image sequence. At this stage, the target tracking algorithm mainly has two major branches, one is based on the correlation filtering algorithm, and the other is based on the deep learning algorithm. The target tracking method ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/246
CPCG06T7/248G06T2207/10016G06T2207/20081G06T2207/20084Y02T10/40
Inventor 王正宁曾浩潘力立赵德明曾仪刘怡君彭大伟
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products