Template reconstruction method based on self-attention mechanism

What is Al technical title?
Al technical title is built by PatSnap Al team. It summarizes the technical point description of the patent document.
An attention and mechanism technology, applied in neural learning methods, computer components, character and pattern recognition, etc., can solve problems such as hindering the transfer of reconstruction methods, inability to eradicate influence, and poor interpretability of convolutional neural networks.

Active Publication Date: 2020-07-07

CHENGDU UNIV OF INFORMATION TECH

View PDF5 Cites 1 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

[0006] 1. In the deep self-encoding network, the low-dimensional features extracted by the encoding network generally cannot contain all the detailed features that are conducive to reconstruction, so the reconstructed image often presents varying degrees of blur, which leads to false positives in detection

Although methods such as Generative Adversarial Networks (GAN) have been proposed in recent years to alleviate the blurring problem, their principles are based on statistical information from other training images rather than the image to be reconstructed itself, so it cannot completely solve the problem of false positives

[0007] 2. For the decoding network, the operation of transposed convolution (transposed-conv / deconv) is usually used for feature upsampling. Since transposed convolution is a pathological problem, checkerboard artifacts often occur in the final reconstruction result. , thus affecting the detection results

Although methods such as subpixel convolution (subpixel conv) have been proposed to alleviate this phenomenon, they still cannot eradicate its influence.

[0008] 3. Due to the poor interpretability of the convolutional neural network itself, it is difficult to feed back from the observation of the reconstruction effect to the design of the deep autoencoder network. The selection of hyperparameters such as low-dimensional feature dimensions and the number of convolution kernels is usually Based on empirical settings, it is difficult to achieve the optimal balance in effect and efficiency, and it also hinders the transfer of reconstruction methods between different scene problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment 1

[0029] A template reconstruction method based on the self-attention mechanism. First, the input image is down-sampled layer by layer to obtain the down-sampled image, and the features of the down-sampled image are extracted, and then the features of each level are searched by the self-attention mechanism. Table mapping obtains the residuals corresponding to each level, and finally upsamples the downsampled image level by level, and fuses the residual information of each level to generate an upsampled image corresponding to each level, and finally generates an upsampled image as the reconstructed template image.

Embodiment 2

[0031] The present invention is on the basis of above-mentioned embodiment 1, as figure 1 As shown, taking a network that has been downsampled four times as an example, the template reconstruction network presents a U-shaped structure as a whole: the left wing starts from the input image and downsamples step by step, while the right wing starts from the smallest downsampled image and successively downsamples Level upsampling; in the process of downsampling on the left wing, the downward thin arrow indicates simple image downsampling calculations, such as nearest neighbor calculation, bilinear calculation, bicubic interpolation calculation, etc.; while in the process of upsampling on the right wing, the direction is upward The thin arrow of represents image upsampling calculation; such as figure 1 As shown, the input image is down-sampled for the first time to generate a 1 / 2 down-sampled image, and the 1 / 2 down-sampled image is down-sampled for the second time to generate a 1 / 4...

Embodiment 3

[0041] This embodiment provides a template reconstruction and detection device based on the self-attention mechanism, based on the above method, such as figure 2 As shown, the device includes a training module and a detection module connected to each other; the training module includes a sequentially connected training data set and a reconstruction model module; the detection module includes a sequentially connected sensor, an input image module, and a differential detection module; Described detection module also comprises the reconstruction template module that is connected with input image module, sealing up detection module; Described training module, detection module are connected by reconstruction model module, reconstruction template module;

[0042] The training module is an external server of the device, and is used to learn and obtain a reconstruction model through a series of training sets. The detection module is the main part of the device. The thin arrow in the ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a template reconstruction method based on a self-attention mechanism. The method comprises the steps of firstly, performing layer-by-layer down-sampling on an input image to obtain a down-sampled image; performing feature extraction on the down-sampled image; then respectively performing table look-up mapping on features of each layer by adopting a self-attention mechanism to obtain a residual error corresponding to each layer, finally, performing layer-by-layer up-sampling on the downsampled image, and meanwhile, fusing residual error information of each layer to generate an up-sampled image corresponding to each layer, thereby finally generating an up-sampled image, namely a reconstructed template image.

Description

technical field [0001] The invention belongs to the field of machine vision detection, and in particular relates to a template reconstruction method based on a self-attention mechanism. Background technique [0002] The defect detection method based on template comparison is widely used in the field of machine vision detection because of its advantages of fast speed and high precision. In the defect detection process based on template comparison, in order to improve the detection rate and reduce the false detection rate, it is often necessary to perform template reconstruction preprocessing operations, so as to eliminate the individual differences of the objects to be detected. Generally speaking, template reconstruction can be divided into rigid transformation methods and non-rigid transformation methods. Non-rigid transformation methods have received more and more attention in recent years because of their wider applicable scenarios. [0003] The non-rigid transformation ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

IPC IPC(8): G06N3/04G06N3/08G06K9/62

CPCG06N3/08G06V10/751G06N3/045

Inventor 贾可郭昕袁超何尚杰刘海龙

Owner CHENGDU UNIV OF INFORMATION TECH

Features

R&D
Intellectual Property
Life Sciences
Materials
Tech Scout

Why Patsnap Eureka

Unparalleled Data Quality
Higher Quality Content
60% Fewer Hallucinations

Social media

Patsnap Eureka Blog

Learn More

Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.

Template reconstruction method based on self-attention mechanism

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment 1

Embodiment 2

Embodiment 3

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology