Breast cancer lymph node metastasis prediction method and prediction system based on gene spectrum

A technology of lymph node metastasis and prediction method, which is applied in the field of breast cancer lymph node metastasis prediction method and prediction system, which can solve problems such as overtreatment, achieve the effect of improving accuracy and avoiding overtreatment

Pending Publication Date: 2020-04-28
SHANDONG UNIV
View PDF0 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At the same time, without early and accurate indicators of

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Breast cancer lymph node metastasis prediction method and prediction system based on gene spectrum
  • Breast cancer lymph node metastasis prediction method and prediction system based on gene spectrum
  • Breast cancer lymph node metastasis prediction method and prediction system based on gene spectrum

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0044] A method for predicting lymph node metastasis of breast cancer based on gene profile. According to an embodiment of the present invention, a method for predicting lymph node metastasis of breast cancer based on the combination of gene expression profile and machine learning method, using gene expression profile and clinical data, finally determines and predicts breast cancer A model of lymph node metastasis.

[0045] figure 1 It is a flowchart of a method for predicting breast cancer lymph node metastasis based on gene expression profile according to an embodiment of the present invention.

[0046] like figure 1 As shown, (1) Enter the GEO platform to obtain RNA data and clinical data, enter the GEO (GeneExpression Omnibus) platform, select the GSE17705 data set, and download the file with the suffix .txt.gz, which contains the RNA data of 298 breast cancer samples And clinical data, from which four types of available information are extracted: gene name, normalized_r...

Embodiment 2

[0067] A prediction system for lymph node metastasis of breast cancer based on gene spectrum, including a data preprocessing module, a feature processing module, and a training verification module, the data preprocessing module is used to obtain a sample data set from the GEO platform, and pre-process the sample data Processing, the sample data set includes RNA data and clinical data, the preprocessing includes sample classification, data conversion, data standardization; the feature processing module is used to select differential genes in the data processed by the data preprocessing module, and A machine learning method is used to select gene features; the training and verification module includes at least two prediction models, and the training and verification module is used to input the difference gene as a feature into the prediction model with the highest prediction accuracy obtained through training.

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a breast cancer lymph node metastasis prediction method and prediction system based on a gene spectrum, and belongs to the technical field of data model prediction, and the method comprises the following steps: (1) entering a GEO platform, selecting a data set, and obtaining samples including RNA data and clinical data; (2) performing data preprocessing: dividing the obtained samples into lymph node metastasis samples and lymph node metastasis-free samples; (3) performing feature selection: selecting differential genes with differences in the lymph node metastasis samples and the lymph node metastasis-free samples, and selecting gene features by using a machine learning method; (4) performing prediction: training prediction models through at least two methods, testing the accuracy of different prediction models, and selecting the model with the highest prediction accuracy obtained by taking the differential gene obtained in the step (3) as feature input. According to the method, the TCGA database is utilized, the feature selection method with high accuracy is designed, the prediction accuracy is further improved, and the prediction accuracy can reach 97%.

Description

technical field [0001] The invention relates to a breast cancer lymph node metastasis prediction method and prediction system based on gene expression profiles and machine learning methods, belonging to the technical field of data model prediction. Background technique [0002] According to the "Global Cancer Statistics 2018" by the National Cancer Research Center, breast cancer ranks first in the incidence and mortality of cancer among women worldwide. The main reason for the high mortality rate of cancer is the metastasis of cancer cells, so accurate determination of metastasis indicators in the early stage can effectively increase the survival rate of patients. [0003] In recent years, sequencing technology has developed rapidly and has been widely used in scientific research. The full name of GEO database is GENEEXPRESSION OMNIBUS, which is a gene expression database created and maintained by NCBI, the National Center for Biotechnology Information in the United States. ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B25/10G16B50/00
CPCG16B25/10G16B50/00Y02A90/10
Inventor 张海霞李云鹤袁东风
Owner SHANDONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products