Video target detection method and device based on sparse foreground prior, and storage medium

A target detection and foreground technology, which is applied in the field of video target detection based on sparse foreground prior, can solve the problems of high computing cost, difficulty in mining the timing characteristics of video data, and the inability to effectively use the sparse foreground prior of video data timing continuity, etc. Achieve the effect of improving detection speed and detection accuracy

Active Publication Date: 2021-03-02
XIDIAN UNIV
View PDF5 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, there are new challenges in applying image detection techniques directly to video detection tasks
First, applying the image object detection network directly to each frame in the video for detection will bring h...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video target detection method and device based on sparse foreground prior, and storage medium
  • Video target detection method and device based on sparse foreground prior, and storage medium
  • Video target detection method and device based on sparse foreground prior, and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0047] The following will clearly and completely describe the technical solutions in the embodiments of the present invention with reference to the accompanying drawings in the embodiments of the present invention. Obviously, the described embodiments are some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0048] The present invention provides a video target detection method based on sparse foreground prior. First, the foreground extraction method based on orthogonal subspace learning is used to obtain the motion sparse foreground prior image of each frame of the video; and then the ResNet feature extraction network and feature The pyramid structure extracts the multi-scale semantic enhancement features of the video frame and its sparse foreground im...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a video target detection method based on sparse foreground prior, a storage medium and equipment. A foreground extraction method based on orthogonal subspace learning is adopted to calculate and obtain a sparse foreground prior graph corresponding to each frame in a video; utilizing the ResNet feature extraction network and the feature pyramid structure to obtain semantic enhancement feature maps of the video frame and the sparse foreground image thereof; cascading the semantic enhancement feature map of the sparse foreground prior map with the semantic enhancement feature map of the current frame, and performing convolution fusion operation to obtain foreground prior fusion features of the current frame; mapping on each pixel of the foreground prior fusion featuremap to generate a candidate anchor box; and inputting the foreground prior fusion features and all anchor boxes into the trained classification and regression sub-network to obtain the category and position coordinates of the target object. The sparse foreground prior of the video data is fully mined, and the target detection accuracy is improved.

Description

technical field [0001] The invention belongs to the technical field of computer vision, and in particular relates to a video target detection method, storage medium and equipment based on sparse foreground prior. Background technique [0002] Computer vision is an important field of artificial intelligence that trains computers to learn and understand real-world vision. With the help of pictures and videos and deep learning models, it is possible to accurately classify and identify the targets of interest, and then make further judgments and processing. Computer vision is generally divided into main tasks such as image recognition, object detection, and instance segmentation. Among them, the classification task generally gives the content description of the entire picture, while the detection task pays more attention to specific objects of interest, requiring both recognition and positioning results of the object of interest. Compared with the classification task, detectio...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/62
CPCG06V20/41G06V20/46G06V2201/07G06F18/253G06F18/214Y02T10/40
Inventor 古晶巨小杰马文萍孙新凯刘芳杨淑媛焦李成冯婕
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products