Gene module analysis method

An analysis method and gene technology, applied in the field of data analysis, can solve the problems of gene module deviation, lack of high dimension, and inability to obtain disease module structure.

Active Publication Date: 2019-07-26
ANHUI UNIVERSITY
View PDF2 Cites 7 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Because there are still a lot of errors, missing, and high-dimensional defects in the existing data, it is not good to use the previous clustering algorithm to mine the disease module structure.
[0004] The second type of research method is to ...

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Gene module analysis method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0038] refer to figure 1 , a kind of gene module analysis method that the present invention proposes, comprises:

[0039] S1, input gene-phenotype double-layer network, gene function similarity network and disease gene set s known to be related to disease phenotype x 0 ;

[0040] In this step, the genotype double-layer network is specifically: construct a genotype double-layer network according to the gene network A, the phenotype similarity network B, and the gene-phenotype relationship network C, and the adjacency matrix of the double-layer network can be expressed as where C T is the transpose matrix of C.

[0041] In a specific scheme, collecting D 1 l in a database 1 For protein interaction, which contains N proteins, the protein network can be abstracted as a protein point set V 1 and interacting edge set E 1 Composition of graph G 1 =(V 1 ,E 1 ), the number of nodes is denoted as N=|V 1 |, the number of sides is recorded as l 1 =|E 1 |,E 1 Each edge in ha...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a gene module analysis method. The gene module analysis method comprises the following steps: inputting a gene phenotype double-layer network, a gene function similarity network and a known disease gene set s0 related to a disease phenotype; increasing the connection relationship between genes and phenotypes in the gene phenotype double-layer network; taking genes which arein edge connection with disease genes in s0 and are not in s0 in the gene phenotypic double-layer network as candidate genes; calculating and selecting a candidate gene with the maximum sum of semantic similarity, topological similarity and phenotypic relevance, and adding the candidate gene into s0; and when the expanded candidate gene set does not significantly enrich the GO ontology function annotation and the biological pathway gene related to the disease phenotype at the same time and express the differential gene in the disease phenotype sample and the normal sample any more, recordingthe current algebra as m, and outputting m-1 as candidate genes expanded in the first generation s0 and known disease genes connected with the edges of the expanded candidate genes.

Description

technical field [0001] The invention relates to the technical field of data analysis, in particular to a gene module analysis method. Background technique [0002] At present, the detection methods of disease modules in human interaction networks can be roughly divided into three categories: [0003] The first type of research method is to detect disease modules based on gene expression data. Gene expression profile data is a common and effective data resource for the mining and research of disease modules. Because there are still a lot of errors, missing, and high-dimensional defects in the existing data, it is not possible to obtain good results only by using the previous clustering algorithm to mine the disease module structure. [0004] The second type of research method is to detect disease modules based on a single network, but the one-sidedness and errors of single network data will have a great impact on the mining of gene modules, resulting in a large deviation bet...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
IPC IPC(8): G16B20/00G16B25/00G16B40/00G16H50/70
CPCG16B20/00G16B25/00G16B40/00G16H50/70Y02A90/10
Inventor 苏延森祝火乐张磊
Owner ANHUI UNIVERSITY
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products