Eureka AIR delivers breakthrough ideas for toughest innovation challenges, trusted by R&D personnel around the world.

Epistasis locus mining method based on genetic tabu and Bayesian network

A Bayesian network and locus technology, applied in the field of biological information, can solve the problems of inability to accurately and efficiently detect SNP loci and their combinations, low efficiency, and computational difficulties, and achieve the effect of assisting gene function mining.

Active Publication Date: 2019-03-08
HUAZHONG AGRI UNIV
View PDF5 Cites 8 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, the current epistasis detection methods still have problems such as computational difficulties, high algorithm complexity, low efficiency, and high false positive rate, resulting in the inability to accurately and efficiently detect SNP sites and their combinations associated with diseases

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Epistasis locus mining method based on genetic tabu and Bayesian network
  • Epistasis locus mining method based on genetic tabu and Bayesian network
  • Epistasis locus mining method based on genetic tabu and Bayesian network

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0050] In order to make the object, technical solution and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the specific embodiments described here are only used to explain the present invention, not to limit the present invention.

[0051] 1. Use data in the form of 0, 1, and 2 to represent genotype data. For example, the data of SNP genotype AT is represented as follows: AA is represented by 0, TT is represented by 2, and AT / TA is represented by 1. figure 2 11 SNPs are shown (SNP A ~SNP K ) corresponding to the genotype data of the 4 samples, the last column Class represents the phenotypic traits, where Class=1 represents the case (disease), and Class=0 represents the control (control). In order to improve the calculation efficiency of subsequent conditional mutual information between nodes and Bayesian network scoring, th...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses an epistasis locus mining method based on a genetic tabu and Bayesian network, and the method comprises the following steps: 1, converting genotype data into Boolean data in binary representation; 2, using the logic and the operation to quickly calculate the mutual information between arbitrary SNP locus pairs and phenotype, extracting a top-N node pair, and constructing aninitial network map comprising the SNP loci; 3, generating new individuals based on an initial network individual through randomly adding edges, deleting edges and reversing the edges until the number of network individuals reaches the size of the population; 4, evolving a Bayesian network structure through three operations of the genetic algorithm and a scoring mechanism of the Bayesian network,finding an optimal solution of the network structure, and quickly and accurately obtaining the epistasis locus affecting the phenotypic traits. The method can help biological researchers to obtain epistatic gene loci affecting specific phenotypic traits, thereby assisting in gene function mining, and providing reference for genetic basis analysis of complex quantitative traits of different species.

Description

technical field [0001] The invention relates to the technical field of biological information, in particular to a method for mining epistasis sites based on genetic taboos and Bayesian networks. Background technique [0002] With the continuous improvement and improvement of people's living standards and medical environment, those diseases (such as infectious diseases, malnutrition, etc.) major diseases. Mendelian genetic disease is a single-gene disease, and its genetic process follows the law of Mendelian inheritance. At present, researchers have identified related genetic genes by positional cloning, and basically clarified their genetic methods. Complex diseases account for more than 80% of human diseases and have caused great harm to human health. Common chronic diseases such as asthma, cancer, diabetes, hypertension, Alzheimer's disease, rheumatoid arthritis, schizophrenia, heart disease, cardiovascular disease, obesity, and tumors are collectively referred to as com...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B40/00G06N3/12
CPCG06N3/126
Inventor 刘建晓果杨钟芷漫杨晨胡江峰蒋雅玲梁子珍高辉
Owner HUAZHONG AGRI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Eureka Blog
Learn More
PatSnap group products