Video target detecting method based on cascade regression convolutional neural network

A convolutional neural network and cascade regression technology, applied in the field of image information technology processing, can solve the problems of affecting detection performance, low video target score, and inability to integrate video spatiotemporal information and context information, etc., to achieve improved effect and robustness and the effect of good accuracy and suppression of abnormal conditions

Active Publication Date: 2018-05-22
NANJING UNIV OF INFORMATION SCI & TECH
View PDF5 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

First of all, the region (region) obtained by the RPN network in each frame image is not always reliable, because the sharp appearance change of the target in the video will affect the recall rate of the network
Secondly, the

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Video target detecting method based on cascade regression convolutional neural network
  • Video target detecting method based on cascade regression convolutional neural network
  • Video target detecting method based on cascade regression convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0037] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0038] Unless the context clearly states otherwise, the number of elements and components in the present invention can exist in a single form or in multiple forms, and the present invention is not limited thereto. Although the steps in the present invention are arranged with labels, they are not used to limit the order of the steps. Unless the order of the steps is clearly stated or the execution of a certain step requires other steps as a basis, the relative order of the steps can be adjusted. It can be understood that the term "and / or" used herein refers to and covers any and all possible combina...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a video target detecting method based on a cascade regression convolutional neural network. The video target detecting method comprises the following steps of 1, inputting a video image sequence, and performing CNN characteristic extraction on all image frames of the whole video sequence through the convolutional neural network; 2, classifying the last convolutional characteristic layer of the CNN characteristic by means of an RPN network for obtaining a suggested area, performing cascade classification and regression on the suggested area through a multiscale convolutional characteristic, and obtaining a static picture detecting result of each image frame; 3, using the results with confidences which are larger than 0.6 in the detecting results that are obtained in the step 2 as initial tracking values, tracking the target on the conv5-3 convolutional characteristic of the CNN characteristic through related filtering for obtaining a time sequence suggested area,performing cascade classification and regression on the time sequence suggested area, and obtaining a time sequence detecting result; and 4, suppressing abnormal values in the static picture detectingresults and the time sequence detecting results through a co-occurrence matrix, thereby obtaining a final detecting result.

Description

technical field [0001] The invention belongs to the field of image information technology processing, and in particular relates to a video target detection method based on a cascade regression convolutional neural network. Background technique [0002] Object recognition is a method of automatically locating objects in images. It is a basic problem in the field of computer vision and has applications in many fields, such as monitoring, human-computer interaction, and medical assistance. Early methods can effectively detect single-category objects in images, such as faces and pedestrians, through sliding windows or cascaded classifiers, but cannot detect multi-category objects. [0003] In recent years, thanks to the development of convolutional neural networks, multi-category object detection technology has been significantly improved. Among them, the region-based convolutional neural network consists of region proposal and region classification, and R-CNN transforms target...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/00G06K9/62G06N3/04
CPCG06V20/48G06V20/46G06N3/045G06F18/241
Inventor 刘青山帅惠袁晓彤
Owner NANJING UNIV OF INFORMATION SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products