Target tracking method, system, device and medium based on two-stream convolution neural network
A convolutional neural network and neural network technology, applied in image data processing, instruments, computing, etc., can solve problems such as lack of grasp of motion information, insufficient video timing, lack of large amounts of data, etc., to achieve good network generalization ability, Universal and universal, accurate tracking effect
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Example Embodiment
[0074] Example 1:
[0075] The three-dimensional convolutional neural network is a type of convolutional neural network. It originated in the fields of motion, limb, and gesture detection. It is different from the two-dimensional convolutional neural network commonly used in image classification and detection. It adds a time dimension. It has excellent time-series feature expression ability and was later introduced into the field of video classification and retrieval.
[0076] Different from tasks such as image classification, the visual target tracking task not only needs to extract the characteristics of the target itself, but also needs to extract the movement change information of the target between video frames, that is, the timing characteristics. The present invention provides a target tracking method based on a dual-stream convolutional neural network. The method applies a three-dimensional convolutional neural network to the field of visual target tracking for the first ti...
Example Embodiment
[0121] Example 2:
[0122] Such as Figure 7 As shown, this embodiment provides a target tracking system based on a dual-stream convolutional neural network. The system includes a first building module 701, a second building module 702, an additive fusion module 703, a third building module 704, and a bounding box. Regression module 705, offline training module 706 and online fine-tuning module 707, the specific functions of each module are as follows:
[0123] The first construction module 701 is used to construct a spatial stream two-dimensional convolutional neural network to extract the feature information of the image block in the current frame. The first construction module 701 is as Figure 8 As shown, specifically including:
[0124] The first input unit 7011 is configured to perform Gaussian sampling of S image blocks in the current frame based on the target neighborhood in the previous frame of the current frame, as the input of the spatial stream two-dimensional convolutio...
Example Embodiment
[0148] Example 3:
[0149] This embodiment provides a computer device. The computer device may be a desktop computer, which includes a processor, a memory, a display, and a network interface connected through a system bus. The processor of the computer device is used to provide computing and control capabilities. The memory of the device includes a non-volatile storage medium and an internal memory. The non-volatile storage medium stores an operating system, a computer program, and a database. The internal memory is the operation of the operating system and the computer program in the non-volatile storage medium. An environment is provided, and when the processor executes the computer program stored in the memory, it realizes the target tracking method of the above embodiment 1, as follows:
[0150] Construct a spatial stream two-dimensional convolutional neural network to extract the characteristic information of the image block in the current frame;
[0151] Construct a time serie...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap