Multi-target detection method and device and application thereof

A detection method and multi-target technology, which is applied in the field of devices and multi-target detection methods, can solve problems such as single feature, deprivation of the ability of different visual modes of convolution kernel, and limitation of convolution receptive field, so as to achieve the purpose and advantages of being concise and easy to understand Effect

Pending Publication Date: 2022-03-22
CITY CLOUD TECH HANGZHOU CO LTD
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The advantages of spatial invariance include: variable parameters such as parameter sharing and translation, and its shortcomings are also obvious: the extracted features are relatively single, and the parameters of the convolution kernel cannot be flexibly adjusted according to the input target size; while the channel-specific convolution There is redundancy in the product kernel in the channel dimension
That is to s

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-target detection method and device and application thereof
  • Multi-target detection method and device and application thereof
  • Multi-target detection method and device and application thereof

Examples

Experimental program
Comparison scheme
Effect test

Example Embodiment

[0061] Example 1

[0062] This embodiment provides a multi-target detection method, which aims to effectively extract finer-grained multi-scale spatial information, establish longer-distance channel dependencies, learn richer multi-scale feature representations, and adaptively The feature calibration is performed again on the multi-scale channel attention vector. .

[0063] specific reference figure 1 ,like figure 1 As shown, the method includes steps S1-S2:

[0064] Step S1: acquiring an image to be detected;

[0065] Step S2: Input the image to be detected into a multi-target detection model to obtain a detection result; wherein, the multi-target detection model includes a backbone network, a neck module and a prediction head connected in sequence; the backbone network includes a CBM module and a multi-target detection model. Each of the backbone layers includes a CSP module, and the convolutional layers of the residual units in the CSP module are replaced by segmentati...

Example Embodiment

[0081] Embodiment 2

[0082] This embodiment applies the method of Embodiment 1 to smart city management, such as managing road garbage, out-of-store operation, mobile operation, illegal parking, etc., and provides a city management detection method to capture real-time information through surveillance video, use Multi-target detection methods efficiently identify and manage target events that need to be managed.

[0083] In this embodiment, the violations that often occur in urban management, such as road garbage, out-of-store operations, mobile vendors, illegal parking, low-lying water on the road, random piles of materials, random parking of non-motor vehicles, drying along the street, green damage, violations Set multiple types of events such as billboards, road damage, and illegally posting small advertisements as management targets. That is, if the first detection result obtained by the first image to be processed is input into the multi-target detection model, including...

Example Embodiment

[0100] Embodiment 3

[0101] Based on the same concept as the first embodiment, this embodiment also provides a multi-target detection device for implementing the multi-target detection method described in the first embodiment. For details, refer to Figure 9 , Figure 9 is a structural block diagram of a multi-target detection device according to an embodiment of the present application, such as Figure 9 As shown, the device includes:

[0102] a first acquisition module, used for acquiring an image to be detected;

[0103] The first detection module is used for inputting the to-be-detected image into a multi-target detection model to obtain detection results; wherein the multi-target detection model includes a backbone network, a neck module and a prediction head that are connected in sequence; the backbone network includes A CBM module and a plurality of backbone layers, each of which includes a CSP module, and the convolutional layer of the residual unit in the CSP modu...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a multi-target detection method and device and application thereof. The method comprises the steps of obtaining a to-be-detected image; inputting the to-be-detected image into a multi-target detection model to obtain a detection result; wherein the multi-target detection model comprises a backbone network, a neck module and a prediction head which are connected in sequence; each trunk layer is used for obtaining a second feature map with multi-scale feature information and after channel attention weighting; the neck module is used for performing feature aggregation on the second feature maps output by different trunk layers to obtain an aggregated feature map; the prediction head is used for performing multi-target detection according to the aggregated feature map. The method not only can extract multi-scale feature information, but also can flexibly adjust the parameters of the convolution kernel according to different sizes of the input to-be-detected image, more effectively extract finer-grained multi-scale space information, establish a longer-distance channel dependency relationship, and improve the detection accuracy. And adaptively carrying out feature calibration on the multi-scale channel attention vector again.

Description

technical field [0001] The present application relates to the field of target detection, in particular to a multi-target detection method, device and application thereof. Background technique [0002] In recent years, object detection algorithms have made great breakthroughs. At present, the mainstream target detection algorithms are divided into two categories according to the algorithm stage: the first category is the two-stage target detection algorithm, such as the R-CNN algorithm R-CNN in the Region Proposal, Fast R-CNN, Faster R-CNN, etc., They need to generate the target candidate frame by the algorithm, that is, the target position in the image, and then classify and regress the target candidate frame; the second category is single-stage target detection algorithms, such as Yolo and SSD algorithms, which only Use a convolutional neural network CNN to directly predict the categories and locations of different objects. The first type of method is more accurate, but t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06V20/52G06V20/40G06V10/44G06V10/764G06V10/766G06V10/80G06V10/82G06N3/04G06N3/08G06K9/62
CPCG06N3/082G06N3/047G06N3/048G06N3/045G06F18/2415G06F18/253
Inventor 郁强张香伟毛云青金仁杰
Owner CITY CLOUD TECH HANGZHOU CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products