Model compression method and device, target detection equipment and storage medium

A target detection and compression method technology, applied in the field of deep learning, can solve the problems of poor convolution layer effect, loss of model accuracy, poor versatility, etc., to avoid rough cropping problems, improve input-output ratio, and ensure detection accuracy Effect

Pending Publication Date: 2020-06-23
ZICT TECH CO LTD
View PDF3 Cites 14 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in order to maximize the input-output ratio of the project, a GPU (image processor) server is required to support real-time detection of targets by multiple cameras, and the existing YOLO v3 detection speed (about 16.5ms) is difficult to support real-time detection of multiple cameras
[0003] At present, the following methods are generally used to reduce the scale of the YOLO v3 model and expand its practicability: 1. The low-rank decomposition method can compress the model size by 3 times, but the operation speed has not been significantly improved, and the effect on the convolutional layer is not good; 2. Weight Quantization (Weight Quantization) method, although grouping and shared weight methods can be used to compress the number of parameters required for storage, but there is no compression on the Run-time memory (running memory); 3. Binarization weight 4. Weight pruning / sparsifying (Weight Pruning / Sparsifying) method, based on kernel sparsification for model compression, adding regular term induction to the update of weights, so that most of the weights are It is 0, but after pruning, general-purpose runtime libraries (such as conv2d) cannot be called directly, and a new computing interface needs to be designed separately, which requires dedicated hardware or code library support, and poor versatility

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Model compression method and device, target detection equipment and storage medium
  • Model compression method and device, target detection equipment and storage medium
  • Model compression method and device, target detection equipment and storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0042] Such as figure 1 As shown, according to an embodiment of the first aspect of the present invention, a compression method of a target detection model is proposed, the method comprising:

[0043] Step 102, acquiring a sample image, and constructing a standard detection model;

[0044] Step 104, setting the regular term of the scaling coefficient of the batch normalization layer in the standard detection model;

[0045] Step 106, perform sparse training on the standard detection model according to the regularization item;

[0046] Step 108, pruning the standard detection model according to the scaling factor;

[0047] Step 110, fine-tuning the pruned standard detection model to obtain a target detection model.

[0048] In this embodiment, the standard detection model is constructed according to the sample image, and the original scaling coefficient of the batch normalization (BN, Batch Normalization) layer in the standard detection model is used to measure the importanc...

Embodiment 2

[0055] Such as figure 2 As shown, according to an embodiment of the present invention, a compression method of a target detection model is proposed, the method comprising:

[0056] Step 202, acquire a sample image, and construct a standard detection model;

[0057] Step 204, setting the regular term of the scaling coefficient of the batch normalization layer in the standard detection model;

[0058] Step 206, perform sparse training on the standard detection model according to the regularization item;

[0059] Step 208, sort all the scaling factors according to the preset rules, and generate the serial number of each scaling factor starting from 1;

[0060] Step 210, according to the comparison result of the sequence number and the preset sequence number, delete the channel corresponding to the scaling factor;

[0061] Step 212, fine-tuning the pruned standard detection model to obtain a target detection model.

[0062] In this embodiment, after sparse training, all scali...

Embodiment 3

[0064] Such as image 3 As shown, according to an embodiment of the present invention, a compression method of a target detection model is proposed, the method comprising:

[0065] Step 302, acquiring a sample image, and constructing a standard detection model;

[0066] Step 304, setting the regular term of the scaling coefficient of the batch normalization layer in the standard detection model;

[0067] Step 306, perform sparse training on the standard detection model according to the regularization item;

[0068] Step 308, pruning the standard detection model according to the scaling factor;

[0069] Step 310, acquiring benchmark data of the standard detection model and test data of the target detection model;

[0070] Step 312, calculating the difference between benchmark data and test data;

[0071] Step 314, whether the difference is greater than the reference threshold, if so, enter step 304, if not, enter step 316;

[0072] Step 316, storing the target detection mo...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a model compression method and device, target detection equipment and a storage medium. The compression method comprises the following steps: acquiring a sample image, and constructing a standard detection model; setting a regular term of a scaling coefficient of a batch standardization layer in the standard detection model; performing sparse training on the standard detection model according to the regular term; pruning the standard detection model according to the scaling coefficient; and performing fine adjustment processing on the pruned standard detection model to obtain a target detection model. According to the compression method, the detection precision of the target detection model can be guaranteed, the detection speed of the target detection model is greatly increased, the input-output ratio of actual engineering is greatly increased, the compression method can be directly operated on a mature framework, support of a special algorithm library is not needed, and the universality is improved.

Description

technical field [0001] The present invention relates to the technical field of deep learning, in particular to a compression method of a target detection model, a compression device of a target detection model, a target detection device and a computer-readable storage medium. Background technique [0002] The deep convolutional network mainly uses end-to-end automatic learning to capture the characteristics of objects through strategies such as weight sharing and local connection, so that the network has stronger analytical capabilities and greatly improves the accuracy of target detection compared with traditional algorithms. Among them, the most representative one-stage (single-stage) target detection model YOLO (You Only Look One) iterates three versions, and YOLOv3 achieves a harmonious unity of speed and accuracy. However, in order to maximize the input-output ratio of the engineering project, a GPU (image processor) server is required to support real-time detection of ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/08G06N3/04
CPCG06N3/082G06N3/045
Inventor 邢军华欧阳一村贺涛曾志辉许文龙罗英群吕令广
Owner ZICT TECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products