Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A multi-scale target detection method fusing context information

A target detection and context technology, applied in the fields of deep learning and computer vision, can solve the problems of large time scale and inability to fuse context information, and achieve strong processing capabilities, simple and easy fusion methods, and improved accuracy

Active Publication Date: 2019-05-28
NANJING UNIV OF POSTS & TELECOMM
View PDF3 Cites 29 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Aiming at the problem that the time scale of the target detection technology in the prior art is large and the context information cannot be integrated into the final use features of the target detection, the present invention proposes a multi-scale target detection method that fuses the context information; this method can fuse the context information In the final classification feature, the cost of multi-scale feature fusion can be reduced at the same time, which can not only improve the detection accuracy of small-scale targets, but also accurately detect targets under complex backgrounds such as target occlusion. The specific technical solutions are as follows:

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A multi-scale target detection method fusing context information
  • A multi-scale target detection method fusing context information
  • A multi-scale target detection method fusing context information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] In order to enable those skilled in the art to better understand the solutions of the present invention, the technical solutions in the embodiments of the present invention will be clearly and completely described below in conjunction with the drawings in the embodiments of the present invention.

[0033] In the embodiment of the present invention, a multi-scale target detection method that fuses context information is provided. The method utilizes a deep residual convolutional neural network to extract features from an input image, and saves the last three in the deep residual convolutional neural network. The convolutional features output by the layer are extracted through the last layer of the deep residual convolutional neural network combined with the RPN network to obtain the candidate frame set of the foreground of the input image, and the final candidate frame set is obtained by screening through the improved non-maximum value suppression method. And use the LSTM...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a multi-scale target detection method fusing context information, and the method comprises the steps of extracting the characteristics of an input image through employing a deep residual convolutional neural network, and obtaining a candidate box set which corresponds to the input image and is used for target detection through employing an RPN network and an improved non-maximum suppression method; for each candidate box, extracting to obtain convolutional features output by the deep residual convolutional neural network, and extracting the convolutional features outputted by the last convolutional layer of the deep residual convolutional neural network in four directions of upper, lower, left and right twice by adopting an LSTM method to obtain context feature information; performing regularization and splicing operation on the context information and the convolutional features to obtain multi-scale features fused with the context information; converting the multi-scale features into high-dimensional feature vectors by using a full connection layer, and carrying out target classification and border position detection by using a classification layer and a regression layer. The method has the advantages of being high in precision, good in robustness and strong in adaptability for target detection.

Description

technical field [0001] The invention belongs to the technical field of deep learning and computer vision, and in particular relates to a multi-scale target detection method for fusing context information. Background technique [0002] Object detection is an important branch in the field of computer vision. Target detection is widely used. For example, in image recognition, face detection, and artificial intelligence models, target detection technology is used to quickly and accurately identify targets from pictures. The traditional target detection technology uses the sliding window method to intercept image fragments of the same scale, and then extracts features from these fragments, then classifies and regresses image features, and finally obtains them through the non-maximum suppression method (Non-Maximum Suppression, NMS). The position coordinates of the rectangle. This traditional target detection method has relatively low accuracy due to manual feature extraction. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
Inventor 宫婧许必宵孙知信
Owner NANJING UNIV OF POSTS & TELECOMM
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products