Supercharge Your Innovation With Domain-Expert AI Agents!

Target detection method based on selectable expansion convolution kernel size

A target detection and convolution kernel technology, which is applied to instruments, character and pattern recognition, computer components, etc., can solve problems such as increasing computational complexity and difficulty in achieving real-time

Pending Publication Date: 2022-02-11
SICHUAN UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The two-stage method divides the detection problem into two processes. First, region proposals are generated, and then candidate regions are classified and bounding box regression is performed. The beginning of this type of algorithm is the R-CNN algorithm proposed in 2016, but due to the two-stage A large number of borders will be generated, which greatly increases the computational complexity, so it is difficult to achieve real-time; while the single-stage detector adopts the idea based on regression, abandons the region proposal stage, and predicts the category probability and position coordinates of the object through the anchor point (Anchor) , the final detection result can be obtained through end-to-end learning, in which the abandonment of the region proposal stage can greatly reduce the computational complexity, so it can achieve real-time when the input resolution is appropriate. The representative algorithms are: YOLOv3, DSSD, RfineDet wait

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Target detection method based on selectable expansion convolution kernel size
  • Target detection method based on selectable expansion convolution kernel size
  • Target detection method based on selectable expansion convolution kernel size

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0013] The present invention will be further described below in conjunction with accompanying drawing:

[0014] The method of constructing an optional dilated convolution module is as follows:

[0015] A series of feature layers, pooling layers and activation layers are obtained through the Darknet-53 network, and P in Darknet-53 5 , P 4 The feature layers are sequentially introduced into SDCM, and sent to the corresponding feature layers in the upsampling stage for weighted fusion.

[0016] The network consists of a convolution function (conv), an activation function (leakyrelu) and an average pooling function (avgpool). Given an input F, F obtains an output F' after selecting an expansion coefficient formula, and F' is first obtained through a channel attention mechanism f C , and then the spatial attention mechanism gets the final output F".

[0017] F'=w×conv 1(F)+(1-w)×conv 2(F) (1)

[0018] In the formula, the convolution coefficient of the conv1 function is 1, the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a target detection method based on a selectable expansion convolution kernel size, and relates to the field of computer vision and artificial intelligence. The method includes: firstly, extracting features through a convolutional neural network, and performing feature fusion through a feature pyramid; then, introducing a selectable expansion coefficient convolution module into a corresponding layer of the feature spectrum obtained through fusion, and obtaining better features through the module; and finally, carrying out multi-classification and bounding box regression on the fused feature layer, and continuously training an iteration model to obtain a target detection result after multi-scale fusion. According to the method, the precision is effectively improved, the real-time performance can be kept under an input picture with a certain size, and the method can be applied to places such as machine vision, face recognition, automatic driving, intelligent video and medical detection.

Description

technical field [0001] The invention relates to a target detection method based on a selectable dilated convolution kernel size, which belongs to the technical field of computer vision and intelligent information. Background technique [0002] Object detection is fundamental to computer vision tasks and to many applications in artificial intelligence. For target detection, its definition is as follows: Given an input RGB image, target detection completes two tasks: detection and recognition, that is, knowing what category the object belongs to and finding out where the object is located in the picture. Wherein, the category may be common species in nature, such as human, poultry, vehicle, etc., and a bounding box is used for positioning. Object detection has a wide range of applications in face recognition, automatic driving, human-computer interaction, content-based image retrieval, intelligent video surveillance, etc. [0003] Existing detectors are mainly divided into t...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06V10/80G06V10/774
CPCG06F18/253G06F18/254G06F18/214
Inventor 何小海熊书琪吴晓红陈洪刚卿粼波滕奇志
Owner SICHUAN UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More