Non-rigid multiscale object detection method based on convolutional neural network

A convolutional neural network and object detection technology, applied in the field of detection of non-rigid multi-scale objects, can solve the problems of low universality and robustness of detection methods, and achieve a simple framework, improved performance, and high portability Effect

Pending Publication Date: 2018-03-20
INST OF OPTICS & ELECTRONICS - CHINESE ACAD OF SCI
View PDF5 Cites 35 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The technical problem to be solved by the present invention is: in the real-time detection of targets on mobile devices and small-volume devices, due to the non-rigid multi-scale target shapes and angles in different scenes are changeable, the universality and robustness of traditional detection methods are not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Non-rigid multiscale object detection method based on convolutional neural network
  • Non-rigid multiscale object detection method based on convolutional neural network
  • Non-rigid multiscale object detection method based on convolutional neural network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0023] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0024] Such as figure 1 As shown, the present invention is a non-rigid multi-scale object detection method based on convolutional neural network. The specific steps of the method are as follows:

[0025] Step 1. Annotated image;

[0026] figure 1 represents the entire training process. Input the labeled training data (image to be labeled) into the neural network, and obtain the output result through forward propagation. Compare the output result with the label to get the error (measured by the loss function, the goal of training is to reduce the loss function), update the parameters through the error gradient backpropagation of the neural network, and so on, so that the loss function decreases and converges.

[0027] In the training process, in addition to using the traditional translation, multi-scale, and image contrast adjustment metho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a non-rigid multiscale object detection method based on a convolutional neural network, which comprises the steps of firstly designing a deep network, performing deep network learning and training for target detection by infrared pedestrian data sets with certain representativeness cvc-09, cvc-14 and self marked small target pedestrian data, wherein the infrared pedestriandata sets have small targets with the pixel being about 20*30 and big target pedestrians at a close range; then designing different detection models based on a hardware platform, and evaluating the detection models. The designed network applies 7*7 grid partition and applies six candidate boxes in allusion to each grid, target area nomination is accomplished, and pedestrian target detection is achieved through classification recognition and location regression analysis. The neural network is applied in the design, and the feature extraction ability of the detection model is increased. The real-time performance of detection is realized based on the idea of grid partition, the number of the candidate boxes is increased by applying data sets with different target scales and multiscale training, and finally the small target detecting ability is enhanced.

Description

technical field [0001] The invention relates to the technical field of real-time object detection, in particular to a non-rigid multi-scale object detection method based on a convolutional neural network. Background technique [0002] Such as image 3 As shown, the neuron is a mathematical model that imitates the biological neural structure. Different inputs (such as different pixels) are summed by weighting (such as the convolution operation in the convolutional neural network), and the nonlinear function ( activation function) to obtain the output of the neural node. Such as Figure 4 As shown, many neural nodes can form a neural layer, and many neural layers can build a neural network. The neural network exhibits a strong fitting ability (learning ability) to the training data due to the input through multi-layer nonlinear transformation (nesting of nonlinear functions), thus showing good predictive ability. [0003] For neural network models, there are two phases, tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06N3/08
CPCG06N3/084G06V40/10
Inventor 饶江浩徐智勇张建林
Owner INST OF OPTICS & ELECTRONICS - CHINESE ACAD OF SCI
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products