Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Template reconstruction method based on self-attention mechanism

An attention and mechanism technology, applied in neural learning methods, computer components, character and pattern recognition, etc., can solve problems such as hindering the transfer of reconstruction methods, inability to eradicate influence, and poor interpretability of convolutional neural networks.

Active Publication Date: 2020-07-07
CHENGDU UNIV OF INFORMATION TECH
View PDF5 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] 1. In the deep self-encoding network, the low-dimensional features extracted by the encoding network generally cannot contain all the detailed features that are conducive to reconstruction, so the reconstructed image often presents varying degrees of blur, which leads to false positives in detection
Although methods such as Generative Adversarial Networks (GAN) have been proposed in recent years to alleviate the blurring problem, their principles are based on statistical information from other training images rather than the image to be reconstructed itself, so it cannot completely solve the problem of false positives
[0007] 2. For the decoding network, the operation of transposed convolution (transposed-conv / deconv) is usually used for feature upsampling. Since transposed convolution is a pathological problem, checkerboard artifacts often occur in the final reconstruction result. , thus affecting the detection results
Although methods such as subpixel convolution (subpixel conv) have been proposed to alleviate this phenomenon, they still cannot eradicate its influence.
[0008] 3. Due to the poor interpretability of the convolutional neural network itself, it is difficult to feed back from the observation of the reconstruction effect to the design of the deep autoencoder network. The selection of hyperparameters such as low-dimensional feature dimensions and the number of convolution kernels is usually Based on empirical settings, it is difficult to achieve the optimal balance in effect and efficiency, and it also hinders the transfer of reconstruction methods between different scene problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Template reconstruction method based on self-attention mechanism
  • Template reconstruction method based on self-attention mechanism

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0029] A template reconstruction method based on the self-attention mechanism. First, the input image is down-sampled layer by layer to obtain the down-sampled image, and the features of the down-sampled image are extracted, and then the features of each level are searched by the self-attention mechanism. Table mapping obtains the residuals corresponding to each level, and finally upsamples the downsampled image level by level, and fuses the residual information of each level to generate an upsampled image corresponding to each level, and finally generates an upsampled image as the reconstructed template image.

Embodiment 2

[0031] The present invention is on the basis of above-mentioned embodiment 1, as figure 1 As shown, taking a network that has been downsampled four times as an example, the template reconstruction network presents a U-shaped structure as a whole: the left wing starts from the input image and downsamples step by step, while the right wing starts from the smallest downsampled image and successively downsamples Level upsampling; in the process of downsampling on the left wing, the downward thin arrow indicates simple image downsampling calculations, such as nearest neighbor calculation, bilinear calculation, bicubic interpolation calculation, etc.; while in the process of upsampling on the right wing, the direction is upward The thin arrow of represents image upsampling calculation; such as figure 1 As shown, the input image is down-sampled for the first time to generate a 1 / 2 down-sampled image, and the 1 / 2 down-sampled image is down-sampled for the second time to generate a 1 / 4...

Embodiment 3

[0041] This embodiment provides a template reconstruction and detection device based on the self-attention mechanism, based on the above method, such as figure 2 As shown, the device includes a training module and a detection module connected to each other; the training module includes a sequentially connected training data set and a reconstruction model module; the detection module includes a sequentially connected sensor, an input image module, and a differential detection module; Described detection module also comprises the reconstruction template module that is connected with input image module, sealing up detection module; Described training module, detection module are connected by reconstruction model module, reconstruction template module;

[0042] The training module is an external server of the device, and is used to learn and obtain a reconstruction model through a series of training sets. The detection module is the main part of the device. The thin arrow in the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention provides a template reconstruction method based on a self-attention mechanism. The method comprises the steps of firstly, performing layer-by-layer down-sampling on an input image to obtain a down-sampled image; performing feature extraction on the down-sampled image; then respectively performing table look-up mapping on features of each layer by adopting a self-attention mechanism to obtain a residual error corresponding to each layer, finally, performing layer-by-layer up-sampling on the downsampled image, and meanwhile, fusing residual error information of each layer to generate an up-sampled image corresponding to each layer, thereby finally generating an up-sampled image, namely a reconstructed template image.

Description

technical field [0001] The invention belongs to the field of machine vision detection, and in particular relates to a template reconstruction method based on a self-attention mechanism. Background technique [0002] The defect detection method based on template comparison is widely used in the field of machine vision detection because of its advantages of fast speed and high precision. In the defect detection process based on template comparison, in order to improve the detection rate and reduce the false detection rate, it is often necessary to perform template reconstruction preprocessing operations, so as to eliminate the individual differences of the objects to be detected. Generally speaking, template reconstruction can be divided into rigid transformation methods and non-rigid transformation methods. Non-rigid transformation methods have received more and more attention in recent years because of their wider applicable scenarios. [0003] The non-rigid transformation ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G06N3/04G06N3/08G06K9/62
CPCG06N3/08G06V10/751G06N3/045
Inventor 贾可郭昕袁超何尚杰刘海龙
Owner CHENGDU UNIV OF INFORMATION TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products