Non-rigid multiscale object detection method based on convolutional neural network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A convolutional neural network and object detection technology, applied in the field of detection of non-rigid multi-scale objects, can solve the problems of low universality and robustness of detection methods, and achieve a simple framework, improved performance, and high portability Effect

Pending Publication Date: 2018-03-20

INST OF OPTICS & ELECTRONICS - CHINESE ACAD OF SCI

View PDF5 Cites 35 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0005] The technical problem to be solved by the present invention is: in the real-time detection of targets on mobile devices and small-volume devices, due to the non-rigid multi-scale target shapes and angles in different scenes are changeable, the universality and robustness of traditional detection methods are not high

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0023] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0024] Such as figure 1 As shown, the present invention is a non-rigid multi-scale object detection method based on convolutional neural network. The specific steps of the method are as follows:

[0025] Step 1. Annotated image;

[0026] figure 1 represents the entire training process. Input the labeled training data (image to be labeled) into the neural network, and obtain the output result through forward propagation. Compare the output result with the label to get the error (measured by the loss function, the goal of training is to reduce the loss function), update the parameters through the error gradient backpropagation of the neural network, and so on, so that the loss function decreases and converges.

[0027] In the training process, in addition to using the traditional translation, multi-scale, and image contrast adjustment metho...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a non-rigid multiscale object detection method based on a convolutional neural network, which comprises the steps of firstly designing a deep network, performing deep network learning and training for target detection by infrared pedestrian data sets with certain representativeness cvc-09, cvc-14 and self marked small target pedestrian data, wherein the infrared pedestriandata sets have small targets with the pixel being about 20*30 and big target pedestrians at a close range; then designing different detection models based on a hardware platform, and evaluating the detection models. The designed network applies 7*7 grid partition and applies six candidate boxes in allusion to each grid, target area nomination is accomplished, and pedestrian target detection is achieved through classification recognition and location regression analysis. The neural network is applied in the design, and the feature extraction ability of the detection model is increased. The real-time performance of detection is realized based on the idea of grid partition, the number of the candidate boxes is increased by applying data sets with different target scales and multiscale training, and finally the small target detecting ability is enhanced.

Description

technical field [0001] The invention relates to the technical field of real-time object detection, in particular to a non-rigid multi-scale object detection method based on a convolutional neural network. Background technique [0002] Such as image 3 As shown, the neuron is a mathematical model that imitates the biological neural structure. Different inputs (such as different pixels) are summed by weighting (such as the convolution operation in the convolutional neural network), and the nonlinear function ( activation function) to obtain the output of the neural node. Such as Figure 4 As shown, many neural nodes can form a neural layer, and many neural layers can build a neural network. The neural network exhibits a strong fitting ability (learning ability) to the training data due to the input through multi-layer nonlinear transformation (nesting of nonlinear functions), thus showing good predictive ability. [0003] For neural network models, there are two phases, tra...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & Authority Applications(China)

IPC IPC(8): G06K9/00G06N3/08

CPCG06N3/084G06V40/10

Inventor 饶江浩徐智勇张建林

Owner INST OF OPTICS & ELECTRONICS - CHINESE ACAD OF SCI

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Non-rigid multiscale object detection method based on convolutional neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology