A video moving target tracking method and device based on two-way twin network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A twin network and moving target technology, applied in biological neural network models, neural learning methods, image analysis, etc.

Active Publication Date: 2021-05-11

江苏移动信息系统集成有限公司

View PDF3 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0004] Purpose of the invention: To propose a video moving target tracking method and device based on a two-way twin network to solve the above-mentioned problems in the prior art, improve the tracker accuracy and slow down the tracker drift caused by similar targets

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0072] The overall design idea of the algorithm is as follows: figure 2 as shown, figure 2 Among them, 1 indicates the z-response map of the last layer, 2 indicates the x-response map of the last layer, and * indicates cross-correlation. This framework contains two sub-network branches, semantic network branch and appearance network branch. Among them, the improved version of CIRes22 is used for the semantic branch network, and the network structure is shown in Table 1; the standard AlexNet is used for the appearance branch network. On the output feature map of the last layer of the template image z of the semantic branch network, a channel attention module is embedded; on the output feature map of the last layer of the template image z of the appearance branch network, an adaptive spatial masking strategy is added. Finally, the APCE value of the response map is output by each branch, and the weighted average is performed to obtain the final response map. The position c...

Embodiment 2

[0074] On the basis of Example 1, the single-way twin network uses the same network to extract the semantic features output by the last convolutional layer of the target, ignoring the appearance information of the target. However, the appearance information of the target is also important for the recognition of the target. effect. Therefore, this embodiment designs a tracker based on a multi-way Siamese network, using two different networks to extract the appearance information and semantic information of the target respectively. Specifically, the improved version of CIRes22 is used as the extractor of target semantic features, and AlexNet is used as the extractor of target appearance information. CIRes22 is an improved version based on ResNet. Compared with AlexNet with only 5 layers, the feature semantic discrimination ability extracted by the last layer is significantly better than AlexNet. Therefore, using CIRes22 and AlexNet to extract the semantic and appearance informa...

Embodiment 3

[0081] On the basis of the first embodiment, this embodiment starts with the two parameters of padding and stride in the convolutional network, and modifies the initial network according to these two aspects.

[0082] For padding, it may cause positional deviation during model training. Specifically, when the target moves to the edge of the image, if the network contains a padding operation, then the features extracted by the network will contain the original target part and the filling part of the edge, but for the candidate area in the search image, part of it is It only includes the target itself, and a part is the package target + filling these two parts. Therefore, this leads to inconsistency between the template image and the search area, and thus the final output response cannot truly reflect the similarity of the input image pair. Fortunately, when the object is close to the center of the image, the padding will not have a bad effect. In order to solve the interferen...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention relates to a video moving target tracking method and device based on a two-way twin network. The tracking method includes the following steps: building a semantic network and an appearance network respectively, combining to form a two-way twin network, using the semantic network to extract semantic information, and using the appearance network Extracting appearance information; adjusting the network structure of the semantic network, and embedding an attention module in the semantic network; adding a spatial mask in the appearance network to improve the focusing ability of the appearance network to extract targets; The semantic network and the appearance network are trained separately; the mixed prediction of the target scale and rotation angle is performed, and the position of the tracked target is finally determined. The present invention divides and conquers the appearance and semantics, builds two different networks, and allows them to perform their duties, so that the learning performance of the model is stronger, so the accuracy of the tracker can be improved, and at the same time, the tracker can be slowed down due to similar targets. drift problem.

Description

technical field [0001] The invention relates to a video moving target tracking method and device based on a two-way twin network, and relates to the field of data processing systems or methods for prediction purposes. Background technique [0002] The video moving target tracking technology based on the two-way twin network can realize video tracking of objects, and can be applied to video surveillance, security, target behavior trajectory analysis, human-computer interaction, automatic driving and other fields. [0003] If the existing tracker based on correlation filtering uses deep features, the tracking speed is very slow due to online model update and cannot meet the real-time requirements; while the tracking algorithm based on single-way twin network generally uses the same network to learn the target at the same time. Semantic and appearance features, the learning goal is not clear. Contents of the invention [0004] Purpose of the invention: To propose a video mov...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Patents(China)

IPC IPC(8): G06T7/246G06T7/231G06N3/04G06N3/08

CPCG06T7/248G06T7/231G06N3/08G06T2207/10016G06T2207/20081G06T2207/20084G06N3/045

Inventor 唐志鸿王宏图孙迎春张超溢彭力郑长岭胡仁龙姚洁金花徐姝婷董陵赵玮徐浩

Owner 江苏移动信息系统集成有限公司

A video moving target tracking method and device based on two-way twin network

What is Al technical title? Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document. A twin network and moving target technology, applied in biological neural network models, neural learning methods, image analysis, etc.

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A twin network and moving target technology, applied in biological neural network models, neural learning methods, image analysis, etc.

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology