Multi-modal medical image multi-organ positioning method based on one-to-one target query Transform

What is AI technical title?
AI technical title is built by Patsnap AI team. It summarizes the technical point description of the patent document.
A multi-modal image and medical image technology, applied in the field of medical image processing, can solve the problems of high training time requirements, reduced network depth, complex network structure, etc., to simplify the network structure, reduce redundant calculations, and reduce calculation burden Effect

Pending Publication Date: 2022-04-15

DALIAN UNIV OF TECH

View PDF0 Cites 6 Cited by

Summary
Abstract
Description
Claims
Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology

Problems solved by technology

However, the Transformer network structure is complex and requires a lot of training time; and it needs a large amount of training data to perform well, but it is a time-consuming and laborious task for doctors to manually mark a large number of 3D medical images; the existing 3D Transformer is only through Reduce network depth to maintain a balance between accuracy and learnability

Furthermore, none of the current Transformer models use information from multimodal medical images to improve the accuracy of organ localization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Image

Smart Image Click on the blue labels to locate them in the text.

Viewing Examples

Smart Image

Examples

Experimental program

Comparison scheme

Effect test

Embodiment Construction

[0063] The multi-modal medical image multi-organ localization method based on one-to-one target query Transformer proposed by the present invention is as follows: figure 1 As shown, the purpose of the present invention is to determine if figure 2 The three-dimensional bounding boxes of the multiple organs of the human body shown will be further described below in conjunction with specific embodiments of the present invention.

[0064] S1. Two-dimensional projection and image fusion of three-dimensional multi-modal images

[0065] S11. Data preprocessing

[0066] First, select the finest voxel size in multiple modality images to resample all modalities; then center crop the resampled images so that the spatial resolution and image size in the dataset are consistent.

[0067] S12. Two-dimensional projection and grayscale normalization of multimodal images

[0068] In order to reduce the amount of calculation and reduce the difficulty of positioning, each modality is project...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

PUM

Login to View More

Abstract

The invention provides a multi-modal medical image multi-organ positioning method based on a one-to-one target query Transform, and belongs to the technical field of medical image processing. According to the method, the correlation between positions and sizes of organs is simulated by utilizing a conditional Gaussian model and a self-attention mechanism of Transform. The one-to-one target query architecture compulsorily executes a unique target query on each target organ, and the query sequence is the predicted category, so that classification is not needed, the network structure is simplified, redundant calculation is reduced, and the learning convergence speed is higher. According to the method, a 3D multi-modal image is projected to two orthogonal 2D planes before organ detection is executed, then complementary information from the multi-modal image is combined through a multi-modal fusion method, and finally, an obtained 2D bounding box is subjected to back projection to obtain a 3D bounding box, so that the calculation burden is reduced, and a more stable organ positioning result is obtained.

Description

technical field [0001] The invention belongs to the technical field of medical image processing, and specifically relates to a multi-modal medical image multi-organ positioning method based on a one-to-one target query Transformer, which mainly utilizes the geometric correlation between organs and the information complementarity of multi-modal medical images. Efficient and accurate automatic organ positioning. Background technique [0002] Multimodal medical images are common imaging modes in clinical medicine, such as Positron Emission Computed Tomography / Computed Tomography (PET / CT), T1, T2, and proton in MRI scans. Density maps, multiple spectral images in spectral CT, etc. Taking PET / CT images as an example, PET images are functional images that reflect the distribution of radioactive tracers in the body, and can be used for benign and malignant tumor diagnosis and tissue metabolism quantification; CT images reflect the absorption of X-rays by human organs and tissues ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine

Login to View More

Application Information

Patent Timeline

Login to View More

Patent Type & AuthorityApplications(China)

IPC IPC(8): G06V10/77G06V10/40G06V10/82G06T3/00G06T5/50G06N3/04G06N3/08

CPCG06T5/50G06N3/08G06T2207/10088G06T2207/10104G06T2207/20081G06T2207/20084G06T2207/20221G06N3/045G06F18/2135G06T3/06

Inventor王洪凯刘林琳

OwnerDALIAN UNIV OF TECH

Multi-modal medical image multi-organ positioning method based on one-to-one target query Transform

AI Technical Summary This helps you quickly interpret patents by identifying the three key elements: Problems solved by technologyMethod usedBenefits of technology

Problems solved by technology

Method used

Image

Examples

Embodiment Construction

PUM

Abstract

Description

Claims

Application Information

AI Technical Summary
This helps you quickly interpret patents by identifying the three key elements:
Problems solved by technology
Method used
Benefits of technology