Multi-modal medical image multi-organ positioning method based on one-to-one target query Transform

A multi-modal image and medical image technology, applied in the field of medical image processing, can solve the problems of high training time requirements, reduced network depth, complex network structure, etc., to simplify the network structure, reduce redundant calculations, and reduce calculation burden Effect

Pending Publication Date: 2022-04-15
DALIAN UNIV OF TECH
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the Transformer network structure is complex and requires a lot of training time; and it needs a large amount of training data to perform well, but it is a time-consuming and laborious task for doctors to manually mark a large number of 3D medical images; the existing 3D Transformer is only through Reduce network depth to maintain a balance between accuracy and learnability
Furthermore, none of the current Transformer models use information from multimodal medical images to improve the accuracy of organ localization

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Multi-modal medical image multi-organ positioning method based on one-to-one target query Transform
  • Multi-modal medical image multi-organ positioning method based on one-to-one target query Transform
  • Multi-modal medical image multi-organ positioning method based on one-to-one target query Transform

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The multi-modal medical image multi-organ localization method based on one-to-one target query Transformer proposed by the present invention is as follows: figure 1 As shown, the purpose of the present invention is to determine if figure 2 The three-dimensional bounding boxes of the multiple organs of the human body shown will be further described below in conjunction with specific embodiments of the present invention.

[0064] S1. Two-dimensional projection and image fusion of three-dimensional multi-modal images

[0065] S11. Data preprocessing

[0066] First, select the finest voxel size in multiple modality images to resample all modalities; then center crop the resampled images so that the spatial resolution and image size in the dataset are consistent.

[0067] S12. Two-dimensional projection and grayscale normalization of multimodal images

[0068] In order to reduce the amount of calculation and reduce the difficulty of positioning, each modality is project...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention provides a multi-modal medical image multi-organ positioning method based on a one-to-one target query Transform, and belongs to the technical field of medical image processing. According to the method, the correlation between positions and sizes of organs is simulated by utilizing a conditional Gaussian model and a self-attention mechanism of Transform. The one-to-one target query architecture compulsorily executes a unique target query on each target organ, and the query sequence is the predicted category, so that classification is not needed, the network structure is simplified, redundant calculation is reduced, and the learning convergence speed is higher. According to the method, a 3D multi-modal image is projected to two orthogonal 2D planes before organ detection is executed, then complementary information from the multi-modal image is combined through a multi-modal fusion method, and finally, an obtained 2D bounding box is subjected to back projection to obtain a 3D bounding box, so that the calculation burden is reduced, and a more stable organ positioning result is obtained.

Description

technical field [0001] The invention belongs to the technical field of medical image processing, and specifically relates to a multi-modal medical image multi-organ positioning method based on a one-to-one target query Transformer, which mainly utilizes the geometric correlation between organs and the information complementarity of multi-modal medical images. Efficient and accurate automatic organ positioning. Background technique [0002] Multimodal medical images are common imaging modes in clinical medicine, such as Positron Emission Computed Tomography / Computed Tomography (PET / CT), T1, T2, and proton in MRI scans. Density maps, multiple spectral images in spectral CT, etc. Taking PET / CT images as an example, PET images are functional images that reflect the distribution of radioactive tracers in the body, and can be used for benign and malignant tumor diagnosis and tissue metabolism quantification; CT images reflect the absorption of X-rays by human organs and tissues ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06V10/77G06V10/40G06V10/82G06T3/00G06T5/50G06N3/04G06N3/08
CPCG06T3/0031G06T5/50G06N3/08G06T2207/10088G06T2207/10104G06T2207/20081G06T2207/20084G06T2207/20221G06N3/045G06F18/2135
Inventor 王洪凯刘林琳
Owner DALIAN UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products