Method for utilizing machine learning to predict complex disease susceptibility locus

A machine learning and disease technology, applied in the fields of instrumentation, informatics, genomics, etc., to achieve the effect of improving heritability

Inactive Publication Date: 2017-11-10
XI AN JIAOTONG UNIV
View PDF1 Cites 18 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

At present, there are multiple databases revealing genomic epigenetic information, but hundreds of millions of genetic markers and multi-dimensional element information have brought great challenges to the prediction of genetic loci

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for utilizing machine learning to predict complex disease susceptibility locus

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] The content of the present invention will be described in further detail below in conjunction with the accompanying drawings.

[0031] Example: Taking the complex disease type II diabetes as an example, the method of the present invention is used to predict the susceptibility loci of type II diabetes, which will be described in detail below.

[0032] Such as figure 1 As shown, the present invention provides a screening method for predicting complex disease susceptibility loci based on the characteristics of genome epigenetic regulatory elements by using machine learning, including the following steps P1-P3.

[0033] P1: Collect known type 2 diabetes susceptibility loci as positive sets for machine learning models and perform annotation of epigenetic regulatory elements.

[0034] It specifically includes: collecting 65 known susceptibility SNPs for type 2 diabetes from relevant literature in the public database GWAS catalog, PheGenI and Pubmed, as a positive set. After...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for utilizing machine learning to predict a complex disease susceptibility locus. The method comprises the following steps of 1, collecting a known complex disease susceptibility locus as a positive set of a machine learning model, predicting a locus irrelevant to a complex disease according to the positive set as a negative set, and annotating an epigenetic regulation element; 2, utilizing machine learning to establish a complex disease epigenetic model; 3, predicting all loci in a whole-genome range according to the established model to obtain a final prediction result as a potential susceptibility locus of the complex disease. According to the method for utilizing machine learning to predict the complex disease susceptibility locus, epigenetic information and genome DNA information are combined, epigenetic element features are extracted through machine learning, the susceptibility locus of the complex disease is further predicted in a whole-genome range, heritability explained by the found susceptibility locus can be obviously improved, and a potential target is provided for subsequent medicine design and disease detection.

Description

technical field [0001] The invention relates to the technical field of prediction of complex disease susceptibility loci, in particular to a screening method for predicting complex disease susceptibility loci using machine learning. Background technique [0002] In recent years, genome-wide association analysis has become the hottest and most effective research method for revealing complex disease susceptibility loci (Single nucleotide polymorphism, SNP). Using this method, more than 2,000 papers have been published in international high-level journals, and nearly 10,000 complex disease susceptibility loci have been successfully identified. Although the results of genome-wide association analysis are quite fruitful, it is far from what scientists expected - to find most of the disease susceptibility loci. For a specific complex disease, the reported disease susceptibility loci accumulatively explain less than 15% of the genetic variation of the disease, and there are still ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/24G06F19/18G06F19/12
CPCG16B5/00G16B20/00G16B40/00
Inventor 董珊珊杨铁林姚石陈一霄郭燕张钰洁
Owner XI AN JIAOTONG UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products