Unlock instant, AI-driven research and patent intelligence for your innovation.

Target detection method and system based on multi-scale feature map reconstruction and knowledge distillation

A multi-scale feature, target detection technology, applied in the field of computer vision target detection, can solve the problems of optimization and high time complexity of YOLOv3

Active Publication Date: 2020-09-04
NANJING UNIV OF POSTS & TELECOMM
View PDF2 Cites 24 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Low-resolution, semantically strong features are upsampled and combined with high-resolution, semantically weak features to build a feature pyramid that shares rich semantics at all levels, but it still has a lot of room for improvement, For example, [Shu Liu, Lu Qi, Haifang Qin, Jianping Shi, and Jiaya Jia. Path aggregation network for instance segmentation. In CVPR, 2018] manually designed the fusion structure and enhanced feature fusion to improve the detection accuracy a lot, but these algorithms It has not been optimized in combination with YOLOv3 and the actual scene, and there is still a lot of room for improvement in feature map reconstruction
[0005] For the model compression method of target detection, many previous works have been proposed to compress large CNNs or directly learn more effective CNN models for fast reasoning, such as literature [E.L.Denton, W.Zaremba, J.Bruna, Y.LeCun, and R.Fergus.Exploiting linear structure within convolutional networks efficient evaluation.In NIPS,2014.] Applied low-rank approximation, literature [S.Han,J.Pool,J.Tran,and W.Dally.Learning both weights and connections for Efficient neural network.In NIPS, pages 1135–1143,2015.] used weight pruning, etc., but most of these techniques require specially designed software / hardware accelerators to accelerate execution, model compression for target detection on embedded devices There are relatively few methods, and the current compression algorithm compresses YOLOv3 with high time complexity, which cannot well complete the target detection tasks in embedded device application scenarios (such as pedestrian and vehicle target detection in intelligent transportation)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Target detection method and system based on multi-scale feature map reconstruction and knowledge distillation
  • Target detection method and system based on multi-scale feature map reconstruction and knowledge distillation
  • Target detection method and system based on multi-scale feature map reconstruction and knowledge distillation

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0067] The technical scheme of the present invention is described in detail below in conjunction with accompanying drawing:

[0068] Such as figure 1 As shown, a target detection method based on multi-scale feature map reconstruction and knowledge distillation disclosed in the embodiment of the present invention, taking pedestrian and vehicle detection as an example, uses the target detection algorithm YOLOv3 [Redmon J, Farh adiA.Yolov3:An incremental improvement[J].arXiv preprint arXiv:1804.027 67,2018] extracts features from the CityStreet urban street view dataset provided by City University of Hong Kong, generates multi-scale feature maps, and then compresses the feature maps along the spatial dimension to compress each A two-dimensional feature channel is compressed into a real number with a global receptive field, and its output dimension matches the number of input feature channels. Weights are generated for each feature channel by modeling, and then the weights are wei...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a target detection method and system based on multi-scale feature map reconstruction and knowledge distillation. The method comprises the steps that firstly, a backbone networkDarkne-53 is utilized to extract features, and deep features are subjected to up-sampling and shallow feature tensor splicing to generate a multi-scale feature map; then, a feature re-calibration strategy is adopted to automatically obtain the weight of each channel in the feature map, useful features are improved according to the weights, useless features are inhibited, and semantic informationof top-layer features and detail information of bottom-layer features are fused through a residual error module; the gamma coefficients of the batch normalization layer in the backbone network are introduced into a pruning target function for training, and a channel where the gamma coefficients lower than a threshold are located is removed from the model according to a pruning threshold; and finally, the trained YOLOv3 reference model is taken as a teacher network, and the pruned model is taken as a student network to perform knowledge distillation. The method and system improve the detectionprecision of objects with different sizes in a large range, reduce the calculation amount of the model, and improve the detection speed of the model.

Description

technical field [0001] The invention provides a target detection method and system based on multi-scale feature map reconstruction and knowledge distillation, belonging to the technical field of target detection in computer vision. Background technique [0002] Image target recognition is a research topic involving computer vision, pattern recognition and artificial intelligence. With the rapid development of hardware technology, embedded smart devices based on deep learning platforms are becoming more and more mature, and more and more detection algorithms are embedded in to smart devices, but the traditional detection method has a large difference in the detection accuracy of targets of different sizes within a certain range, cannot accurately identify targets, and cannot meet daily needs, and the traditional detection algorithm has too many model parameters, and the required computing power is relatively large. Therefore, it is necessary to propose a technology that can n...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06K9/62G06N3/04G06N3/08
CPCG06N3/084G06V10/751G06V2201/07G06N3/044G06N3/045G06F18/241G06F18/253G06F18/214
Inventor 刘天亮平安戴修斌邹玉龙
Owner NANJING UNIV OF POSTS & TELECOMM
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More