Check patentability & draft patents in minutes with Patsnap Eureka AI!

Cancer driven recognition based on functional regions of protein sequence

A protein sequence and functional region technology, applied in the field of data mining, can solve problems that have not been fully considered

Pending Publication Date: 2020-02-11
HUNAN UNIV
View PDF3 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In summary, existing methods do not fully consider the role of a large number of intermediate-sized functional elements in identifying cancer drivers, and rarely estimate the impact of amino acid sequence changes on protein function from comparing individual amino acids and calculate their functional impact scores This direction identifies cancer driver regions

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Cancer driven recognition based on functional regions of protein sequence
  • Cancer driven recognition based on functional regions of protein sequence
  • Cancer driven recognition based on functional regions of protein sequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0042] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail in combination with experiments below. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0043]The hardware environment is mainly a PC host. Among them, the CPU of the PC host is Intel(R) Core(TM) i7-6700, 3.40GHz, the memory is 32GB RAM, and the 64-bit operating system. The software uses Windows 7 as the platform and is implemented in R language in the RStudio environment. The RStudio version is 1.1.142 and the R language version is 3.5.0.

[0044] The data used are two relatively complete cancer data sets downloaded from TCGA, breast cancer (Breast Invasive Carcinoma, BRCA) and glioblastoma multiforme (Glioblastoma Multiforme, GBM). There are many, and the data that can be used to verify the results are more co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

Identifying cancer drivers is a key challenge in explaining the mechanisms of cancer development and achieving precision medicine. Many ways are provided to identify cancer drivers based on a single mutation site or an entire gene, but a lot of medium-sized functional elements are ignored. The fact that mutations which occur in different regions of a protein sequence have different effects on cancer progression is hypothesized. The invention develops a new function-driven region (frDriver) recognition method based on Bayesian probability and multiple linear regression models to identify protein regions which can regulate gene expression levels and have high functional impact potential. By combining gene expression data and somatic mutation data with functional impact scores (SIFT, PROVEAN)as prior knowledge, we identified cancer-driving regions with the most accurate prediction of gene expression levels. We evaluated the performance of frDriver on TCGA's BRCA and GBM datasets. The results indicate that frDriver identifies known cancer drivers and outperforms the other three state-of-the-art methods (eDriver, ActiveDriver and OncodriveCLUST).

Description

technical field [0001] The invention relates to data mining in bioinformatics, in particular to mining of cancer bioinformatics data. Specifically, it relates to a method for identifying novel functionally driven cancer region correlations based on protein sequence functional regions through Bayesian probability and multiple linear regression models. Background technique [0002] Identification of cancer drivers is a key challenge in explaining cancer mechanisms and enabling precision medicine. There are many approaches to identify cancer drivers based on single mutation sites or entire genes. But they omit a large number of medium-sized functional elements. [0003] Mutations can affect cellular regulatory processes, signaling processes, and more. Due to the complexity of the mutations, different functional effects resulted. Compared with germline cells, somatic cells acquire mutations faster, tens to hundreds of times faster than primary cells. However, only a small s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G16B30/10G16B40/00G16H70/60
CPCG16B30/10G16B40/00G16H70/60
Inventor 卢新国袁玥王新宇丁莉高妍
Owner HUNAN UNIV
Features
  • R&D
  • Intellectual Property
  • Life Sciences
  • Materials
  • Tech Scout
Why Patsnap Eureka
  • Unparalleled Data Quality
  • Higher Quality Content
  • 60% Fewer Hallucinations
Social media
Patsnap Eureka Blog
Learn More