Dense crowd counting method of multi-scale convolutional neural network

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
A technology of convolutional neural network and dense crowd, applied in the field of dense crowd counting of multi-scale convolutional neural network, can solve the problems of gradient disappearance, poor feature extraction, over-fitting of deep convolutional neural network, etc. Effects of denaturation, guaranteed scale invariance, and low hardware requirements

Pending Publication Date: 2020-11-17

BEIJING NORMAL UNIV ZHUHAI

View PDF0 Cites 0 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

Looking at all network structures, it is not difficult to find that deep convolutional neural networks and multi-column convolutional neural networks are the most used. Although deep convolutional neural networks and multi-column convolutional neural networks perform well in dense crowd scenes, but The deep convolutional neural network is easy to cause overfitting, and because the depth of the network structure is too deep, there may be a problem of gradient disappearance, which also increases the difficulty of training the network model. It is difficult to train multi-column convolutional neural networks. First, each column needs to be pre-trained, and then the whole is trained, and the network structure with shallower depth and fewer columns performs poorly in feature extraction, and is easily affected by the multi-scale problem of the image.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0052] Embodiment 1: to the inspection of mainstream data set Shanghai Tech Part_A data set;

[0053] Shanghai Tech dataset: The Shanghai Tech dataset consists of 1198 pictures, of which 482 pictures belong to Shanghai Tech Part_A, and 716 pictures belong to Shanghai Tech Part_B. Part_A and Part_B are two independent parts, such as Figure 4 shown;

[0054] Among them, Shanghai Tech Part_A has a relatively dense crowd, and includes indoor and outdoor crowd pictures, including color pictures and black and white pictures;

[0055] The training set and test set of Part_A are 300 pictures and 182 pictures respectively. Among these pictures, the number of people is 3139, and the number of people is 33, such as Figure 5 As shown, in the training set of Part_A, there are 230 pictures with the largest number of people under 700 people, and only 1 picture with the least number of people with more than 2800 people and less than 3500 people.

[0056] In Part_A, the network model of th...

Embodiment 2

[0057] Example 2: Inspection of the mainstream data set Shanghai Tech Part_B data set

[0058] Shanghai Tech dataset: The Shanghai Tech dataset consists of 1198 pictures, of which 482 pictures belong to Shanghai Tech Part_A, and 716 pictures belong to Shanghai Tech Part_B. Part_A and Part_B are two independent parts, such as Figure 4 shown;

[0059] The Shanghai Tech dataset Part_B has 716 pictures, all of which were taken from above on the streets of Shanghai, including black and white and color pictures. The training set and test set of Part_B are 400 pictures and 316 pictures respectively. Such as Image 6 As shown, in the training set of Part_B, there are 213 pictures with the largest number of people under 100; the least pictures with more than 400 people and less than 500 people are only 7 pictures; in the test set of Part_B, the pictures with the number of people less than 100 The most, there are 169 pictures; the pictures with more than 500 people and less than 600...

Embodiment 3

[0061] Example 3: Inspection of the mainstream data set UCF_CC_50 data set

[0062] The UCF_CC_50 data set has only 50 black and white pictures in total, and there is no training set and test set, such as Figure 4 shown.

[0063] The number of people in these pictures has a large span, the largest number is 4543 people, and the smallest number is 94 people. The distribution of the number of people in the data set pictures is as follows Figure 7 As shown, there are 25 pictures with the number of people less than 1000 the most; pictures with more than 3000 people and less than 4000 people are the least, only 2 pictures.

[0064] On the UCF_CC_50 dataset, the network model of the SPMsCNN algorithm takes about 1.34 seconds to run a picture. Such as Figure 9 As shown, the SPMsCNN algorithm achieved the best experimental results on the UCF_CC_50 dataset. Compared with the ADR (ADeeply-Recursive, ADR) algorithm, the MAE and MSE of the SPMsCNN algorithm were reduced by 36.8 and ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention discloses a dense crowd counting method of a multi-scale convolutional neural network, a network structure adopting an SPMsCNN algorithm adopts a multi-scale convolutional neural networkalgorithm based on accumulation pooling, a multi-scale module in the algorithm is provided with a plurality of filters with different sizes, more character feature information is extracted, and the information is reserved through feature splicing. Meanwhile, accumulation pooling is introduced, a traditional pooling layer is improved under the condition that parameters and hyper-parameters are notintroduced, the characteristic that the maximum pooling can guarantee the scale invariance of a network model is utilized, the pooling layers with small filters are overlaid, the scale invariance isguaranteed in a larger range, fine-grained information is reserved, and meanwhile, the pooling performance of the network model is improved. The number of network structure layers of the SPMsCNN algorithm is not large, and the phenomenon of gradient disappearance or gradient explosion cannot be caused.

Description

【Technical field】 [0001] The invention relates to big data statistics and collection algorithm technology, in particular to a multi-scale convolutional neural network dense crowd counting method. 【Background technique】 [0002] The methods of counting the number of dense crowds can be roughly divided into three categories: detection-based methods, regression-based methods, and density estimation-based methods. The detection method, as the name implies, is to detect people. As a detection standard, except The overall characteristics of the person, as well as the characteristics of the unique parts of the person, such as the head, shoulders, ears, etc., in order to effectively reduce the impact of the occlusion problem on the experiment, many detection-based methods are based on the detection of human body parts; The detection-based method first needs to preprocess the picture, then extract the foreground of the picture, and then use the detector to judge which is the person i...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06K9/00G06K9/46G06N3/04G06N3/08

CPCG06N3/08G06V20/53G06V10/464G06N3/045Y02T10/40

Inventor 杨戈

Owner BEIJING NORMAL UNIV ZHUHAI

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Dense crowd counting method of multi-scale convolutional neural network

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology