Differential methylation site identification method

An identification method and methylation technology, applied in the fields of genomics, special data processing applications, instruments, etc., to achieve the effect of stable model, better performance, and better performance

Active Publication Date: 2017-10-13
UNIV OF ELECTRONICS SCI & TECH OF CHINA
View PDF6 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, using traditional statistical methods can only roughly find sites with statistically significant differences, so many differentially methylated sites are found, and not all differentially methylated sites have the function of cancer diagnosis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Differential methylation site identification method
  • Differential methylation site identification method
  • Differential methylation site identification method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0036] For the convenience of description, the relevant technical terms appearing in the specific implementation are explained first:

[0037] fastDMA (Fast Differential MethylationAnalysis): Rapid differential methylation analysis method

[0038] ChAMP (The Chip Analysis Methylation Pipeline): Methylation chip analysis

[0039] TCGA (The Cancer Genome Atlas): Cancer Genome Atlas;

[0040] SWAN (Subset-quantile Within Array Normalization): Subset quantile normalization method;

[0041] ComBat(Empirical Bayes methods): Empirical Bayesian method;

[0042] ESCA (esophageal carcinoma): Esophageal cancer.

[0043] figure 1 It is a flowchart of a differential methylation site identification method of the present invention.

[0044] In this example, if figure 1 As shown, a differential methylation site recognition method of the present invention comprises the following steps:

[0045] S1. Randomly obtain N groups of 450K methylation microarray data samples of a cancer from the...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a differential methylation site identification method. According to a thought of classification, identification for differential methylation sites is converted to searching of sites having important contributions to classification by a feature selection approach, and the sites having important contributions to classification are differential methylation sites. Specifically the differential methylation site identification method includes: firstly preprocessing 450K methylation chip data acquired from a public data standardizing the 450K methylation chip data to eliminate within-block errors, eliminating batch effects to eliminate inter-group errors and eliminating the sites small in variances; secondarily, constructing a random forest model to obtain contribution values of every site to classification; finally, determining the site as the differential methylation site if the contribution value of the site is larger than 0. The differential methylation site identification method has the advantages that the obtained differential methylation sites have better class judging performances and can provide more accurate results for cancer diagnosis.

Description

technical field [0001] The invention belongs to the technical field of DNA methylation recognition, and more specifically relates to a differential methylation site recognition method. Background technique [0002] As the most typical epigenetic phenomenon in the human genome, DNA methylation plays an important role in many key physiological activities. Its methylation status is closely related to the occurrence of various diseases, especially cancer. Specifically, not all methylation sites are associated with cancer, only some specific methylation sites are associated with cancer. These specific methylation sites are called differential methylation sites in this paper. [0003] At present, statistical methods are usually used in differential methylation site identification algorithms, such as fastDMA using variance analysis, and ChAMP using linear regression combined with t hypothesis testing methods. However, using traditional statistical methods can only roughly find s...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/00
CPCG16B20/00G16B40/00
Inventor 凡时财宋应邹见效何建徐红兵
Owner UNIV OF ELECTRONICS SCI & TECH OF CHINA
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products