Unlock instant, AI-driven research and patent intelligence for your innovation.

Target tracking method based on capsule network and natural language query

A natural language and target tracking technology, applied in the field of target tracking, can solve the problems of simple alignment strategy, inability to capture different entities and part-entity relationships, etc., and achieve good robustness.

Pending Publication Date: 2022-01-14
HARBIN INST OF TECH
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, previous methods have not achieved competitive results on the dataset, and there are two problems: (1) the alignment strategy between text query and video frame is too simple; (2) 2D convolution-based model decoding The latter representations often fail to capture the relationships between different entities and part-entities

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Target tracking method based on capsule network and natural language query
  • Target tracking method based on capsule network and natural language query
  • Target tracking method based on capsule network and natural language query

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0045] The technical solution of the present invention will be further described below in conjunction with the accompanying drawings, but it is not limited thereto. Any modification or equivalent replacement of the technical solution of the present invention without departing from the spirit and scope of the technical solution of the present invention should be covered by the present invention. within the scope of protection.

[0046]The present invention provides a target tracking method based on capsule network and natural language query, and the method proposes a capsule regression tracking network (CapsuleTNL) based on natural language query. In the first step, feature representations for word-level queries and search regions are obtained by textual and visual encoders, respectively. Second, in order to be able to guide the two-way interaction between video frames and natural language, a Vision-Text Routing Module (VTRM) and a Text-Visual Routing Module (TVRM) are proposed...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a target tracking method based on a capsule network and natural language query, and the method comprises the following steps: 1, giving a search region of a current frame and a corresponding text query, sending the search region into a visual encoder to extract the feature representation of vision, and sending the text query into a text encoder to extract the feature representation of a text; 2, constructing a visual capsule by using the visual feature representation extracted by the visual encoder, constructing a text capsule by using the text feature representation extracted by the text encoder, and designing a visual-text routing module and a text-visual routing module on the basis of the visual capsule and the text capsule; and 3, serially connecting the output of the vision-text routing module and the output of the text-vision routing module, and generating a response diagram of the target through a decoder. The tracker is initialized only by using the natural language, the method can be close to other methods, and meanwhile, the result obtained by performing initialization by using the natural language query and the initial bounding box has good robustness.

Description

technical field [0001] The invention relates to a target tracking method, in particular to a target tracking method based on capsule network and natural language query. Background technique [0002] Recently, Tracking by Natural Language Query (TNL) has gained increasing attention because it does not require a hand-labeled rectangular box to initialize a tracker to locate an object of interest. The combination of natural language understanding and visual target tracking has the following advantages: first, this combination breaks through the limitation of manually marking the initial frame; second, by jointly optimizing the two heterogeneous features of vision and language, the tracking method can be significantly improved in the long-term. The ability to accurately locate targets during tracking. However, previous methods have not achieved competitive results on the dataset, and there are two problems: (1) the alignment strategy between text query and video frame is too si...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06T7/246G06K9/62G06V10/764G06F16/33G06N3/04G06N3/08
CPCG06T7/246G06F16/33G06N3/08G06N3/04G06T2207/20081G06T2207/20084G06F18/24
Inventor 邬向前卜巍马丁
Owner HARBIN INST OF TECH