Small target detection method based on multi-scale images and weighted fusion loss

A small target detection and weighted fusion technology, applied in character and pattern recognition, instruments, biological neural network models, etc., can solve the problems of resolution uncertainty, inability to take care of targets, and single input image scale.

Active Publication Date: 2020-07-28
SOUTH CHINA UNIV OF TECH
View PDF3 Cites 33 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] The difficulty of small target detection lies in the scale. Inputting the network model with a single size to extract feature information often cannot take care of targets of various scales.
Although the existing Mask RCN

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Small target detection method based on multi-scale images and weighted fusion loss
  • Small target detection method based on multi-scale images and weighted fusion loss
  • Small target detection method based on multi-scale images and weighted fusion loss

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0080] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0081] A small target detection method based on multi-scale images and weighted fusion loss, based on the improved Mask RCNN model, including:

[0082] S1. Build an improved Mask RCNN model.

[0083] In a preferred embodiment, the improved Mask RCNN model includes a backbone network part, a candidate window generation part and a classification layer part, which are built using the keras platform, including: a residual backbone network, a feature pyramid network layer, a region proposal layer, an interest Box alignment layer, classifier layer, loss function calculation layer, test layer. Compared with the original Mask RCNN, the improvements include:

[0084] ①. The alignment of the region of interest is no longer uniformly aligned, but the different feature layers are aligned separately. After the alignment, the input loss function is not dir...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the field of image and video processing, and relates to a small target detection method based on multi-scale images and weighted fusion loss, and the method comprises the steps: extracting a plurality of groups of feature vectors from a plurality of different-scale images based on an improved Mask RCNN model, carrying out the fusion of the plurality of groups of feature vectors, and constructing a feature pyramid; generating a candidate detection box based on the feature pyramid and screening to obtain a suggested detection box; correspondingly returning the suggesteddetection boxes to the feature pyramid to generate feature maps of the suggested detection boxes, and performing aligned interception on the feature maps; inputting the aligned suggested detection boxes into a classifier layer to obtain category confidence coefficients and position offsets of the suggested detection boxes; in the test stage, screening a certain suggested detection box according tothe category confidence score of the suggested detection box, and performing non-maximum suppression; in the training stage, weighting the loss function calculated by detecting the small target feature layer and fusing with the loss function of detecting the large target layer and the middle target layer, thus the sensitivity of the model to the small target object is enhanced.

Description

technical field [0001] The invention belongs to the field of image and video processing, and relates to a small target detection method based on multi-scale images and weighted fusion loss. Background technique [0002] With the development of machine learning and deep learning, relying on the powerful learning ability of convolutional neural network, the fields of pattern recognition and computer vision have received unprecedented attention and enthusiasm. In the era of machine automation and artificial intelligence, the role of cameras is increasingly equal to that of human eyes. The development of computer vision is particularly important, and has attracted extensive attention from industry and academia. Among them, target detection has achieved remarkable results in the field of computer vision and is constantly improving. However, most of the target objects in pictures and videos appear in extremely tiny forms. Usually there are a lot of objects in a frame of pictures...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G06K9/32G06K9/34G06K9/62G06N3/04
CPCG06V10/25G06V10/267G06V2201/07G06N3/045G06F18/24G06F18/253Y02T10/40
Inventor 林坤阳罗家祥
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products