Coding and decoding structure-based crowd counting and positioning method
A positioning method and crowd counting technology, applied in the field of computer vision, can solve problems such as the counting method is not as simple as the density map, performance degradation, and weak positioning performance, and achieve excellent positioning performance, improved robustness, and simple counting effects
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0058] A label map generation method, said method comprising the following steps:
[0059] Step S1, making a data set; first collect image data of crowds in different environments in the actual scene, and then mark the data;
[0060] Step S2, generate a label map; generate a label map according to the marked data, the generation method of the label map is as follows:
[0061]
[0062]
[0063]
[0064] Among them, B is the coordinate set of the marked point, (x', y') is the pixel coordinate of the marked point in the label map, where x' indicates the abscissa of the marked point in the label map, and y' indicates the vertical coordinate of the marked point in the label map Coordinates; (x, y) represents the pixel coordinates of any point in the image, where x is the abscissa of any point in the image, y is the ordinate of any point in the image, and P(x, y) represents the coordinates ( x, y) to the distance from the nearest marked point, I (x, y) is the corresponding...
Embodiment 2
[0068] The purpose of this embodiment is crowd counting and positioning, aiming to give the number of people and positioning information in the image through an algorithm.
[0069] The counting part selects public datasets SHHA, SHHB and UCF_CC_50 as experimental materials. Among them, SHHA contains 300 training pictures and 182 test pictures; SHHB contains 400 training pictures and 316 test pictures; UCF_CC_50 contains 50 pictures.
[0070] First, use the label generation method proposed by the present invention to convert the labeled content of the above data set into a label map for training and testing.
[0071] Secondly, build the network model, the overall structure of the algorithm is as follows figure 1 As shown, the encoding part includes 7x7 convolution, maximum pooling layer, Res-1, Res-2, and Res-3. Except that the 7x7 step size is 1, the rest of the structure is the same as that of ResNet50. Taking the input picture 3×256×256 as an example, after 7×7 convolutio...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com