Method for identifying cancer driver gene by using consensus prediction result

A technology for driving genes and predicting results, applied in genomics, instrumentation, proteomics, etc., can solve the problems that the integrated results need to be improved and will not give clone growth advantages

Pending Publication Date: 2021-11-05
上海基绪康生物科技有限公司
View PDF0 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0002] Next-generation sequencing technologies have identified millions of somatic mutations in human cancer cells, however, one of the major challenges in interpreting cancer genomes is how to efficiently distinguish driver mutations from guest mutations, under specific microenvironmental conditions in vivo, Driver mutations are causally linked to oncogenes and are actively selected along the cancer developmental lineage, whereas guest mutations do not confer a clonal growth advantage and are therefore irrelevant to tumor development. To address this issue, a variety of unique-based Hypotheses and Strategies to Identify Driver Gene Approaches
[0003] Several studies have been reported to benchmark these methods using consensus cancer driver genes derived from individual models, and Collin et al. proposed an evaluation framework based on the inclusion of precision, agreement, and mean log-fold change (MLFC ) to benchmark several existing models, Matan et al. also benchmarked available methods by using measures such as precision and recall, Edward et al. Dividing driver gene calling methods into four subtypes, Denis et al. provided the most comprehensive benchmark of 21 driver gene prediction methods and proposed a Borda-based integration method ConsensusDriver, but in existing methods for identifying driver genes The reliability of the integrated results needs to be improved, so a method for identifying cancer driver genes using consensus prediction results is proposed to solve the above problems

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for identifying cancer driver gene by using consensus prediction result
  • Method for identifying cancer driver gene by using consensus prediction result
  • Method for identifying cancer driver gene by using consensus prediction result

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0043] The technical solutions in the embodiments of the present invention will be clearly and completely described below with reference to the embodiments of the present invention. Obviously, the described embodiments are only a part of the embodiments of the present invention, rather than all the embodiments. Based on the embodiments of the present invention, all other embodiments obtained by those of ordinary skill in the art without creative efforts shall fall within the protection scope of the present invention.

[0044] The present invention designs a web server-based consensus cancer driver gene calling program platform to derive consensus mutation calling results, using six state-of-the-art technologies and complementary prediction strategies, and provides an efficient integrated strategy to achieve robust Consensus results were derived using rank order aggregation" (RRA) and statistical model-based intersection visualization, and consensus mutation calling results were...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for identifying a cancer driver gene by using a consensus prediction result. The method comprises the following steps: S1, receiving a mutation annotation format (MAF) file as input; S2, processing all the preprocessed input mutation data so as to respectively obtain a candidate driver gene list of each strategy; S3, on the basis of each differential driving gene list, obtaining a common driving gene list by using a rank integration method RobustRankAggreg; S4, evaluating the performance of the result by using Top-N-Precision and Top-N-nDCG, and carrying out KEGG pathway and gene ontology analysis on the common driver gene; S5, obtaining a consensus driving gene list by using an RAA algorithm; S6, employing the SuperExactTest and the Circos for organizing a visualization result. The method has certain superiority in driver gene prediction, although height difference exists between different driver gene identification strategies, not only can the most reliable driver gene be identified through cross analysis of results of each independent strategy, but also potential novel driver genes with undefined features can be found.

Description

technical field [0001] The invention relates to the technical field of cancer driver gene identification, in particular to a method for identifying cancer driver genes by using consensus prediction results. Background technique [0002] Next-generation sequencing technologies have identified millions of somatic mutations in human cancer cells, however, one of the major challenges in interpreting cancer genomes is how to efficiently distinguish driver mutations from guest mutations, under specific microenvironmental conditions in vivo, Driver mutations are causally related to oncogenes and are actively selected along the cancer developmental lineage, whereas guest mutations do not confer clonal growth advantages and are therefore not associated with tumor development. To address this issue, a variety of models based on unique Hypotheses and strategies to identify driver genes. [0003] Several studies have been reported to benchmark these approaches using consensus cancer dr...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B20/50G16B45/00G16B40/00
CPCG16B20/50G16B45/00G16B40/00
Inventor 韦嘉叶翔赟吴金波
Owner 上海基绪康生物科技有限公司
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products