Method for mining whole genome key genes by differentiation and association rules

A differential analysis and key gene technology, applied in the field of data processing, can solve problems such as insufficient mining of data information, failure to provide reliable basis for biological experiments, and failure to comprehensively consider the interaction of samples, so as to improve identification efficiency and reduce data dimensions Effect

Active Publication Date: 2018-05-15
XIDIAN UNIV
View PDF13 Cites 9 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0003] To sum up, the problems existing in the existing technology are: the method of gene-gene interaction does not comprehensively consider the interaction between samples and genes, and the information contained in the data is not fully explored, and the results cannot be used for subsequent biological research. Experiments provide a reliable basis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Method for mining whole genome key genes by differentiation and association rules
  • Method for mining whole genome key genes by differentiation and association rules
  • Method for mining whole genome key genes by differentiation and association rules

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0022] In order to make the objectives, technical solutions and advantages of the present invention clearer, the present invention will be further described in detail below in conjunction with embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.

[0023] The invention organically combines the differential analysis method and the association rule mining algorithm to complement each other's advantages, thereby better identifying the prominently prominent genomes in the data set, and improving the identification ability of key genes related to complex diseases.

[0024] The application principle of the present invention will be described in detail below with reference to the accompanying drawings.

[0025] Such as figure 1 As shown, the method for mining key genes of the whole genome by combining differential analysis and association rules provided by the embodiment of ...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention belongs to the technical field of data processing, and discloses a method for mining whole genome key genes by differentiation and association rules. The method comprises the following steps: folding multiple pores by using a corresponding relationship between a gene name and a probe name, and then preliminarily screening treated genes by using an application differentiation method to obtain genes with remarkable differential expression; dividing all samples of differential genes into an up-regulation transaction set and a down-regulation transaction set according to threshold values, separately mining genes of the two transaction sets by the association rules, and finding out genes which correspond to top-ranked rules, namely the key genes; and finally, verifying the effectiveness of the genes by biological function analysis. Association between samples and association between genes are considered comprehensively, whole genome expression data can be processed, the key genes with outstanding expression are found out, and the method has important significance on disease risk prediction, research on pathogenesis of complicated diseases, biological pharmacy technologiesand the like.

Description

Technical field [0001] The invention belongs to the technical field of data processing, and in particular relates to a method for mining key genes of the whole genome by combining differential analysis and association rules. Background technique [0002] With the rapid development of gene sequencing technology, life science research has acquired powerful data generation capabilities, including biological data such as genomics, transcriptomics, proteomics, and genetic metabolomics. Combining knowledge of statistics, biology and computer science, these massive amounts of data reveal the biological mysteries contained therein, which has become a hot spot in bioinformatics. Genome-Wide Gene Expresison (GWGE) research refers to selecting a number of statistical samples of normal and case groups in a specific population, and comparing the expression values ​​of each gene in the normal and case groups at the whole genome level Difference. If the expression value of a gene is too differ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/24G06F19/18
CPCG16B20/00G16B40/00
Inventor 杨利英张粉利袁细国
Owner XIDIAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products