Training method of crowd counting network

A technology of crowd counting and training method, applied in the field of crowd counting network training, can solve the problems of increasing crowd counting, gradient explosion, parameter increase, etc., and achieve the effect of increasing network depth

Pending Publication Date: 2022-03-08
SHANGHAI INST OF TECH
View PDF4 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, in actual situations, due to the variable shooting angles, the sizes of the heads in the pictures are not uniform, and there are serious occlusions and uneven distribution of crowds in high-density scenes, which will increase the difficulty of crowd counting and crowd positioning tasks.
The emergence of convolutional neural networks provides a better way to achieve these two tasks. Usually we want the depth of the network to be as deep as possible to better map the relationship between input and output, but as the depth of the network increases, The amount of parameters will increase, making network training difficult, and even causing gradient explosion or gradient disappearance

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Training method of crowd counting network
  • Training method of crowd counting network
  • Training method of crowd counting network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0054] In order to make the above objects, features and advantages of the present invention more comprehensible, the present invention will be further described in detail below in conjunction with the accompanying drawings and specific embodiments.

[0055] Facing the current problems of low network depth and multiple scale changes, the purpose of the present invention is to design a method that can extract multi-scale information and increase network depth.

[0056] The present invention provides a training method for a crowd counting network, including: S1. When performing crowd counting, the network framework is shown in figure 1 , including the following steps:

[0057] The front end of the encoder of the S1-1 network uses the first ten layers of VGG16_bn to input the sample picture to the front end of the encoder to extract the feature information of the picture.

[0058] S1-2 The feature map extracted by the front end is sent to the back end of the encoder, which uses f...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a training method of a crowd counting network, and the method provided by the invention effectively extracts multi-scale information through a plurality of groups of pyramid convolution kernels with different voidage rates, and solves the problem that the sizes of heads are not uniform. By adding batch normalization to each layer of output, the problem of difficulty in training caused by increase of the depth of the network is solved, meanwhile, the depth of the network is further improved through a residual structure under the condition that the parameter quantity is not increased, and high robustness is achieved.

Description

technical field [0001] The invention relates to a training method for a crowd counting network. Background technique [0002] Crowd counting and crowd localization is an important task in current computer vision. However, in actual situations, due to the variable shooting angles, the sizes of the heads in the pictures are not uniform, and there are serious occlusions and uneven distribution of crowds in high-density scenes, which will increase the difficulty of crowd counting and crowd positioning tasks. The emergence of convolutional neural networks provides a better way to achieve these two tasks. Usually we want the depth of the network to be as deep as possible to better map the relationship between input and output, but as the depth of the network increases, The amount of parameters will increase, making network training difficult, and even causing gradient explosion or gradient disappearance. Contents of the invention [0003] The object of the present invention is...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08G06V40/10
CPCG06N3/08G06N3/045
Inventor 赵怀林梁兰军周方波
Owner SHANGHAI INST OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products