Mutually-exclusively constrained graph Laplacian approach to heterogeneous cancer driver gene identification

A technology of driving genes and identification methods, applied in genomics, proteomics, medical automated diagnosis, etc., can solve problems affecting the effective identification of driving genes, and achieve the effect of improving the performance of gene identification

Active Publication Date: 2021-10-22
NORTHWESTERN POLYTECHNICAL UNIV
View PDF8 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

The invention can solve the problem of differential estimation of parameters of cancer samples and effective identification of driver genes in local samples affected by the interaction, and realizes the identification of driver genes that mutate in local samples from the gene variation data of heterogeneous cancer samples

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Mutually-exclusively constrained graph Laplacian approach to heterogeneous cancer driver gene identification
  • Mutually-exclusively constrained graph Laplacian approach to heterogeneous cancer driver gene identification
  • Mutually-exclusively constrained graph Laplacian approach to heterogeneous cancer driver gene identification

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0027] The present invention will be further described below in conjunction with the accompanying drawings and embodiments, and the present invention includes but not limited to the following embodiments.

[0028] Such as figure 1 As shown, the present invention provides a method for identifying heterogeneous cancer driver genes with mutually exclusive constraint graph Laplacian, and its specific implementation process is as follows:

[0029] Step 1: Obtain cancer gene variation data through databases such as The Cancer Genome Atlas (TCGA) and International Cancer Genome Consortium (ICGC). Data on gene interaction networks were collected from the STRING interaction database. The ID of the gene name was unified through the gene annotation database Database for Annotation, Visualization and Integrated Discovery (DAVID), so as to eliminate the phenomenon of homonym of genes in different sources of data.

[0030] Step 2: Use a matrix model to describe the heterogeneity of cancer...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The present invention provides a method for identifying heterogeneous cancer driver genes based on a mutually exclusive constrained graph Laplacian. First, obtain cancer genome variation data and gene interaction network; then, use a matrix model to describe the heterogeneity of cancer, and use mutually exclusive constraint matrix decomposition to estimate the sample parameters of heterogeneity cancer; Then, construct the mutual exclusion constraint matrix decomposition optimization function regularized by the joint association and interaction network, and correct the parameters of the driving genes affected by the interaction in the local samples through iterative solution; finally, use the outlier test method to identify the driving genes . The invention can solve the problem of differential estimation of parameters of cancer samples and effective identification of driver genes in local samples affected by interaction, and realizes the identification of driver genes mutated in local samples from gene variation data of heterogeneous cancer samples.

Description

technical field [0001] The invention belongs to the technical fields of bioinformatics and genome data mining, and in particular relates to a method for identifying heterogeneous cancer driver genes based on mutually exclusive constraint graph Laplacian. Background technique [0002] Cancer is a high-incidence malignant disease, mainly caused by mutations in driver genes. However, in the cancer genome, there are a large number of concomitant mutations that have nothing to do with carcinogenesis, causing serious confusion in the identification of driver genes. Because driver gene mutations are more likely to occur in multiple samples at the same time than concomitant mutations, existing studies mainly use the gene variation data of cancer samples, regard driver genes as high-frequency mutation genes in multiple samples, and examine gene Statistical Significance of Mutation Rates, Finding Driver Genes with Significant High-Frequency Variations in Multiple Samples. For exampl...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G16B20/50G16H50/20
CPCG16H50/20G16B20/50
Inventor 习佳宁黄庆华
Owner NORTHWESTERN POLYTECHNICAL UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products