Breast cancer data classification method based on single category

A data classification and breast cancer technology, applied in the field of pattern recognition, can solve the problems of difficulty in collecting negative samples of breast cancer data and high collection costs, and achieve the effects of reducing sample labeling costs, accurate classification, and stable results

Active Publication Date: 2019-09-06
NANJING UNIV OF SCI & TECH
View PDF4 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] Although the above methods can obtain better diagnostic and classification results, breast cancer data in real life have problems such as difficulty in collecting negative samples and expensive collection costs. The above methods are difficult to solve such problems and all include observation labels that do not depend on data strong assumption

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Breast cancer data classification method based on single category
  • Breast cancer data classification method based on single category
  • Breast cancer data classification method based on single category

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0046] In this embodiment, the Breast Cancer data set is used as an example of breast cancer data, and the data set is obtained by breast fine needle aspiration. In breast cancer data, 11 cytological characteristics of breast aspiration aspiration differ between benign and malignant samples. The characteristic attributes of breast cancer data include mass thickness, cell size uniformity, cell shape uniformity, and single epithelial cell size. , bare core, etc. In order to verify the robustness of the present invention, this embodiment sets three situations where the positive unlabeled rate is 20%, 40% and 60% respectively when constructing the positive sample set and the unlabeled sample set, namely: set the malignant to positive class sample(s i = 1), 20%, 40% and 60% of malignant samples and all benign samples are taken out to form an unlabeled sample set (s i =0). image 3 , Figure 4 and Figure 5 The curves of the classification accuracy of the training set and the t...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a breast cancer data classification method based on a single category. According to the breast cancer data classification method, breast cancer data classification is effectively carried out only by utilizing single-class sample data and unlabeled sample data; a probability graph model is adopted to describe the relation among the data characteristics, the observation tagsand the real labels, and a corresponding joint probability distribution model is given, and then an EM algorithm is used for solving the model, and then test data are classified. The breast cancer data set used in the breast cancer data classification method is only composed of a single-class sample and an unlabeled sample, so that the sample labeling cost is reduced, and the problem that negative-class sample data are lacked in breast cancer diagnosis is solved.

Description

technical field [0001] The invention belongs to pattern recognition technology, in particular to a single-category-based breast cancer data classification method. Background technique [0002] Existing breast cancer data classification methods are often based on supervised learning methods, which cannot fully meet the reality of breast cancer diagnosis. More and more scholars at home and abroad are paying attention to the research and application of intelligent diagnosis of breast cancer, and have done a lot of research in this area. The main methods in recent papers are: [0003] Olvi L.Mangasarian and others took the lead in using linear programming for breast cancer diagnosis in 1995; K.Polat et al. used the least squares support vector machine to maximize the classification interval to classify breast cancer data in 2007; Ayman M.Eldeib et al. People in 2016 employing deep belief networks for breast cancer data classification. [0004] In terms of patents, the Chinese...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06K9/62
CPCG06V2201/03G06F18/2415
Inventor 史红宫辰魏旸杨健
Owner NANJING UNIV OF SCI & TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products