Deep neural network plug-in based on attention mechanism and image recognition method

A deep neural network and attention technology, applied in the field of deep neural network plug-ins and image recognition, can solve problems such as CNN recognition errors, recognition target interference, etc., and achieve the effect of improving recognition ability

Pending Publication Date: 2021-03-30
JINAN UNIVERSITY
View PDF0 Cites 2 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] 1. We know that the objects in the image usually only occupy a part of the entire space, or even a small part, and in many cases, there are a large number of background pixels in the image, and many of these pixels in the image have nothing to do with the recognition target, even Interfere with the recognition target; however, for CNN, all pixels in the image are equally weighted;
[0004] 2. During the recognition process, CNN can only propagate forward once
Therefore, if the perturbation in the adversarial example is effective in this process, it is likely to cause the CNN to misidentify

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep neural network plug-in based on attention mechanism and image recognition method
  • Deep neural network plug-in based on attention mechanism and image recognition method
  • Deep neural network plug-in based on attention mechanism and image recognition method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0041] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0042] An attention mechanism-based deep neural network plug-in described in an embodiment of the present invention is composed of two layers of LSTMs of the same size and a CNN with a multi-layer structure. in,

[0043] One layer of LSTM is used to memorize contextual information and generate a mask image with salient features, and the other layer of LSTM is used to realize the function of "glance" and generate classification confidence, where "glance" means that it is equivalent to human The process of identifying objects with the eyes. For example, when seeing a person, you may not see clearly at...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a deep neural network plug-in based on an attention mechanism and an image recognition method. The plug-in is composed of two layers of LSTMs with the same size and a CNN of amulti-layer structure, wherein one layer of LSTM is used for memorizing context information and generating a mask image with remarkable characteristics, and the other layer of LSTM is used for realizing a glance function and generating classification confidence; the CNN of the multi-layer structure is used for down-sampling, extracting image features and transmitting context information to the LSTM unit. By using the CNN of the multi-layer structure and applying the attention mechanism to guide the CNN of the multi-layer structure to focus the key features of the target object in the image and remove the secondary features and the background pixels, the recognition capability is high, the target object is gradually recognized through multiple propagation, the main features of the object are gradually remembered, and the secondary features of the object are forgotten; and through multiple backward feedback, an error correction function is achieved.

Description

technical field [0001] The invention belongs to the technical field of artificial intelligence and image recognition, and in particular relates to an attention mechanism-based deep neural network plug-in and an image recognition method. Background technique [0002] At present, the deep neural network used for image recognition is mainly a convolutional neural network (CNN); however, the applicant found that the existing convolutional neural network (CNN) has the following defects in the image recognition process: [0003] 1. We know that the objects in the image usually only occupy a part of the entire space, or even a small part, and in many cases, there are a large number of background pixels in the image, and many of these pixels in the image have nothing to do with the recognition target, even Interfere with the recognition target; however, for CNN, all pixels in the image are equally weighted; [0004] 2. During the recognition process, CNN can only propagate forward ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/04G06N3/08G06K9/46G06K9/62
CPCG06N3/049G06N3/084G06V10/462G06N3/048G06N3/045G06F18/24
Inventor 李海良刘敏郭焕庄师强张明
Owner JINAN UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products