Pulmonary nodule image classification method when uncertain data is contained in data set

A technology for determining data and data concentration, applied in the field of computer vision, can solve problems such as difficult training data, inconsistent labels, and heavy workload

Active Publication Date: 2019-09-10
沈阳铭然科技有限公司
View PDF7 Cites 34 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The traditional diagnosis of pulmonary nodules mainly relies on the observation of doctors and experts, so there are some disadvantages: the diagnostic results are subjective; the workload is heavy and time-consuming; human vision is limited; doctors in remote areas cannot be popularized
Because of the professionalism and particularity of medical image labeling, as well as moral and legal restrictions, it is difficult to obtain a large amount of training data
In general public datasets include the following problems: (i) the dataset is small in size and the class distribution is unbalanced; (ii) it contains noise and uncertain labels
Taking the LIDC-IDRI dataset as an example, lung CT images are marked by multiple experts, each expert marks a part of the data, and each data is also marked by multiple experts, although the workload is reduced and relatively good results can be given. Labeling results, but different experts have different levels, which will produce a lot of inconsistent labels; secondly, many of the data are judged as uncertain types of data and unlabeled data. The traditional processing method is to directly convert uncertain types of data into Discard, but this approach will lose a lot of valuable information

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Pulmonary nodule image classification method when uncertain data is contained in data set
  • Pulmonary nodule image classification method when uncertain data is contained in data set
  • Pulmonary nodule image classification method when uncertain data is contained in data set

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0044] The present invention will be further described below in conjunction with the accompanying drawings and specific embodiments.

[0045] Such as figure 1 As shown, it is a flow chart of the pulmonary nodule image classification method when the data set of the present invention contains uncertain data. The lung nodule image classification method when a data set of the present invention contains uncertain data is characterized in that it comprises the following steps:

[0046] Step 1: organize the data set: collect N CT images of lung nodules to form an image set I={I 1 , I 2 ,...,I n ,...,I N}, the lung nodules are divided into three types: benign, malignant and indeterminate; the expert method is used to classify each lung nodule CT image, and the nth lung nodule CT image I is obtained n The category is e n ; Each piece of pulmonary nodule CT image is preprocessed to obtain a pulmonary nodule CT image data set D; the preprocessing includes converting the pulmonary n...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of computer vision, and provides a pulmonary nodule image classification method when uncertain data is contained in a data set. The method comprises the following steps: firstly, collecting a pulmonary nodule CT image set, determining the category of the image through a majority voting principle by utilizing an expert voting method, and preprocessing toobtain a pulmonary nodule CT image data set; then, based on a knowledge distillation method, constructing a pulmonary nodule image classification model comprising a teacher model and a student model;next, obtaining a determined tag data set, training a teacher model on the determined tag data set, and calculating a soft tag on the pulmonary nodule CT image data set; then, training a student model on the data set combining the hard label and the soft label; and finally, inputting the preprocessed CT image to be classified into the trained lung nodule image classification model to obtain the category of the lung nodule image classification model. According to the method, the uncertain label data in the data set can be effectively utilized, the accuracy and efficiency of pulmonary nodule diagnosis are improved, and the usability and robustness are high.

Description

technical field [0001] The invention relates to the technical field of computer vision, in particular to a pulmonary nodule image classification method when uncertain data is contained in a data set. Background technique [0002] Lung cancer is one of the deadliest diseases in the world, accounting for approximately 26% of all cancer cases in 2017. Despite recent advances in diagnosis and treatment, the five-year cure rate for lung cancer is only 18%. It is worth noting that this rate can be greatly increased if patients are diagnosed early and properly treated. Low-dose computed tomography (CT) has been widely used in lung cancer detection. Compared with other imaging techniques, CT can display low-contrast nodules, which has more advantages. Low-dose CT scans can reduce lung cancer deaths by 20 percent, according to the National Lung Screening report. The traditional diagnosis of pulmonary nodules mainly relies on the observation of doctors and experts, so there are som...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06T7/00G06K9/62
CPCG06T7/0012G06T2207/10081G06T2207/30064G06V2201/03G06F18/214
Inventor 毛克明王新琦常辉东李佳明李翰鹏
Owner 沈阳铭然科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products