Target tracking method, system, device and medium based on two-stream convolution neural network
A convolutional neural network and neural network technology, applied in image data processing, instruments, computing, etc., can solve problems such as lack of grasp of motion information, insufficient video timing, lack of large amounts of data, etc., to achieve good network generalization ability, Universal and universal, accurate tracking effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0075] The three-dimensional convolutional neural network is a kind of convolutional neural network, which originated in the fields of motion, body, and gesture detection. It is different from the two-dimensional convolutional neural network commonly used in the field of image classification and detection. It adds a time dimension, so It has excellent time-series feature expression ability, and was later introduced into the field of video classification and retrieval.
[0076] Different from tasks such as image classification, visual target tracking tasks not only need to extract the features of the target itself, but also need to extract the motion change information of the target between video frames, that is, sequence features. The present invention provides a target tracking method based on a two-stream convolutional neural network. The method firstly applies a three-dimensional convolutional neural network to the field of visual target tracking, and combined with a two-dim...
Embodiment 2
[0122] Such as Figure 7As shown, this embodiment provides a target tracking system based on a two-stream convolutional neural network, which includes a first building block 701, a second building block 702, an additive fusion module 703, a third building block 704, and a bounding box The regression module 705, the offline training module 706 and the online fine-tuning module 707, the specific functions of each module are as follows:
[0123] The first building block 701 is used to build a spatial flow two-dimensional convolutional neural network to extract feature information of image blocks in the current frame. The first building block 701 is as follows: Figure 8 shown, including:
[0124] The first input unit 7011 is configured to perform Gaussian sampling of S image blocks in the current frame based on the target neighborhood in the previous frame of the current frame, as the input of the spatial flow two-dimensional convolutional neural network; wherein, the spatial fl...
Embodiment 3
[0149] This embodiment provides a computer device, which may be a desktop computer, which includes a processor, a memory, a display, and a network interface connected through a system bus, the processor of the computer device is used to provide computing and control capabilities, the computer The memory of the device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, computer programs and databases. The internal memory is the operating system and computer programs in the non-volatile storage medium. An environment is provided, and when the processor executes the computer program stored in the memory, the target tracking method of the above-mentioned embodiment 1 is realized, as follows:
[0150] Construct a spatial flow two-dimensional convolutional neural network to extract the feature information of the image block in the current frame;
[0151] Construct a time series flow three-dimensional convolutiona...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com