Pedestrian detection method based on feature pyramid network

A feature pyramid and pedestrian detection technology, applied in the fields of image processing and pattern recognition, can solve problems such as poor generalization performance, difficult to apply real-time industrial application scenarios, and loss of precision, and achieve a wide range of applications

Inactive Publication Date: 2018-11-16
上海贵和软件技术有限公司
View PDF7 Cites 31 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Among them, MTCNN (face detection) adopts small neural network classification and sliding window for fast detection, which achieves good detection effect and speed, but the generalization performance is poor, and it needs to be retrained for specific scenes.
YOLO and SSD use a deep convolutional network to realize the detection process by classifying and biasing the anchor points of each feature map at one time. Such methods are faster in speed and have better generalization performance, but in terms of accuracy loss
The performance of the two-stage Faster RCNN algorithm is the best compared to other types, but due to the intervention of the fully connected layer, the amount of calculation is greatly increased, and it is difficult to apply to industrial application scenarios that require real-time performance.
[0006] On the one hand, the deepening of network complexity will improve the effect of the network, and at the same time increase the amount of calculation

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pedestrian detection method based on feature pyramid network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0024] see figure 1 , the pedestrian detection method based on the feature pyramid network, including the following steps:

[0025] S1. Establish a FOSFPN network;

[0026] S2. Read the video frame and perform preprocessing;

[0027] S3. Propagate the frame image forward to obtain the detection frame;

[0028] S4. Perform non-maximum value suppression on the detection results to filter redundant detection frames;

[0029] S5. Standardize the detection frame beyond the boundary to obtain the final detection result.

[0030] The present invention designs a new network structure: the Fast One Stage Feature Pyramid Network [Fast One Stage Feature Pyramid Network (FOSFPN)], using a lightweight deep neural network (MobileNet) as the basic network structure to construct a feature pyramid, And the classification loss function and regression loss function are respectively added on the feature maps of different scales for network training.

[0031] The feature pyramid structure com...

Embodiment 2

[0038] The difference from the above embodiment is that the preprocessing of reading the video frame is to subtract the average value of the image from the image sent to the FOSFPN network and normalize to (0,1).

[0039] The reading of the video frame described in this embodiment is to perform mean value processing on the image. Subtract the corresponding mean value for each channel, and then divide by 255 to get the normalized input data;

[0040] The forward propagation of the frame image is to send the frame image into the trained FOSFPN network for forward propagation operation, and the output of each layer of pyramid network is used to calculate the classification results and regression results respectively.

[0041] The classification result represents the probability that the frame image anchor target is a pedestrian or the background; the regression result represents the deviation of the prediction result relative to the frame image anchor point; the anchor point whos...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a pedestrian detection method based on a feature pyramid network. The method comprises the following steps of S1, establishing an FOSFPN; S2, reading video frames and performing preprocessing; S3, forwards transmitting frame images, and obtaining detection boxes; S4, performing non-maximum suppression on a detection result to filter repeated and redundant detection boxes; and S5, standardizing the detection boxes exceeding a boundary, thereby obtaining a final detection result. According to the pedestrian detection method based on the feature pyramid network, provided by the invention, the calculation amount can be reduced; pedestrian identification is taken as an application background; and the performance loss caused by network simplification is reduced.

Description

technical field [0001] The invention belongs to the field of image processing and pattern recognition, and relates to a pedestrian detection method based on a feature pyramid network. Background technique [0002] Feature pyramids are a fundamental building block in multi-scale object detection systems. In recent years, deep learning object detection has deliberately avoided pyramid feature representation, because feature pyramids are expensive in terms of computation and memory. Therefore, the author uses the inherent multi-scale and multi-level pyramid structure of the deep convolutional neural network to construct a feature pyramid network. Using a top-down side connection, a high-level semantic feature map is constructed at all scales. This structure is called Feature Pyramid Network (FPN). It has improved significantly in feature extraction. Using FPN on Faster R-CNN, on the COCO dataset, the result exceeds all current single-model detection methods, and can run up to...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/00G06K9/46G06K9/62G06N3/04G06N3/08
CPCG06N3/08G06V40/103G06V20/41G06V10/462G06N3/045G06F18/2414
Inventor 王慧燕
Owner 上海贵和软件技术有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products