Deep learning method for predicting prognosis risk of cancer patient based on multi-omics data

A technology of omics data and deep learning, applied in machine learning, medical informatics, informatics, etc., can solve problems such as inability to solve target data sets and low accuracy

Pending Publication Date: 2021-05-18
SUN YAT SEN UNIV
View PDF0 Cites 4 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0005] The present invention provides a deep learning method for predicting the prognosis risk of cancer patients based on multi-omics data in order to overcome the above-mentioned defects that the accuracy rate of prognosis risk prediction is not high and the target data set cannot be solved.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Deep learning method for predicting prognosis risk of cancer patient based on multi-omics data
  • Deep learning method for predicting prognosis risk of cancer patient based on multi-omics data
  • Deep learning method for predicting prognosis risk of cancer patient based on multi-omics data

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0044] Such as figure 1 As shown, a deep learning method for predicting the prognosis risk of cancer patients based on multi-omics data is used to predict the prognosis risk of cancer patients, including the following steps:

[0045] S1: Obtain clinical data Y and corresponding multi-omics expression data X of target cancer patients from existing public data sets (such as TCGA, GEO);

[0046] In a specific embodiment, 14 TCGA data sets (BRCA, CESC, COAD, ESCA, HNSC, KIRC, LGG, LIHC, LUAD, LUSC, MESO, PAAD, SRAC and SKCM) were used for pre-training, and bladder cancer ( BLCA) data as the target cancer.

[0047] Among them, multi-omics data includes mRNA expression, miRNA expression, DNA methylation information and copy number variation information of bladder cancer patients. mRNA data is RNA sequencing data generated by UNC Illumina HiSeq_RNASeq V2. miRNA is the miRNA sequencing data obtained by BCGSC Illumina HiSeq miRNASeq. DNA methylation data was generated by USCHumanMe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a deep learning method for predicting the prognosis risk of a cancer patient based on multi-omics data, which is used for predicting the prognosis risk of the cancer patient and comprises the following steps: S1, acquiring clinical data Y of a target cancer patient and corresponding multi-omics expression data X from an existing public data set; S2, constructing a deep neural network; S3, updating the weight theta of the cancer multi-omics data Xp and the clinical information Yp of the patient of the existing common data set through the constructed deep neural network to obtain a pre-training network Np based on the common data set; S4, training the network Np again until the training frequency epoch reaches the operation upper limit, thereby obtaining a risk prediction network Nf; and S5, selecting the first n gene features of the Importance coefficient of the target cancer patient by using an XGboost algorithm, and improving the risk prediction network Nf to obtain a final risk prediction model. According to the method, the robustness of the prediction model is improved, and the prognosis risk of the cancer patient is predicted more accurately by using multi-omics data.

Description

technical field [0001] The present invention relates to the technical field of cancer patient survival analysis, and more specifically, relates to a deep learning method for predicting the prognosis risk of cancer patients based on multi-omics data. Background technique [0002] In recent years, the high incidence of cancer has promoted the development of medical auxiliary technology. Prognostic risk analysis is a key medical auxiliary technology, which can assist in the selection of different treatment options according to the potential risk of prognosis of different patients. [0003] Most of the methods for predicting cancer prognosis are realized by analyzing the expression data of a single group, such as using gene mRNA expression data, methylation data, or miRNA data, etc. Moreover, there are strong complementary effects and interactions between molecules at different levels, so the results of single-omics data analysis often can only provide one-sided information. In...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16H50/20G16H50/80G06N20/00
CPCG16H50/20G16H50/80G06N20/00
Inventor 杨跃东柴华张仲岳周翔
Owner SUN YAT SEN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products