US-ELM-based gene chip expression data analysis system and method

A data analysis system and gene chip technology, applied in the field of medical big data mining, can solve the problems of data analysis scale and efficiency constraints, achieve the effect of multiple similarities and improve accuracy

Active Publication Date: 2017-11-21
NORTHEASTERN UNIV
View PDF17 Cites 10 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Many traditional analysis methods have limitations, which greatly restricts the scale and efficiency of data analysis

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • US-ELM-based gene chip expression data analysis system and method
  • US-ELM-based gene chip expression data analysis system and method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0058] Extreme learning machine (extreme learning machine, ELM) is an easy-to-use and effective single-hidden-layer feed-forward neural network SLFNs learning algorithm. In 2004, it was proposed by Associate Professor Huang Guangbin of Nanyang Technological University. Traditional neural network learning algorithms (such as BP algorithm) need to artificially set a large number of network training parameters, and it is easy to generate local optimal solutions. The extreme learning machine only needs to set the number of hidden layer nodes of the network, and does not need to adjust the input weights of the network and the bias of the hidden elements during the algorithm execution process, and produces the only optimal solution, so it has fast learning speed and generalization The advantages of good performance.

[0059] Unsupervised extreme learning machine (unsupervised extreme learning machine, US-ELM), this algorithm maintains the learning ability and computational effectiv...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a US-ELM-based gene chip expression data analysis system and method. The system includes: a gene preprocessing unit, which is used for preprocessing a gene chip to obtain a data format which is suitable for use in experimentation; a differential gene screening unit, which finds differential genes, of which expression obviously changes in different individuals or different tissues, in the gene chip on the basis of a gene expression data matrix to obtain a differential expression gene matrix; a clustering unit, which is used for carrying out clustering analysis on the differential expression gene matrix to obtain a co-expression gene sequence; and an enrichment analysis unit, which is used for carrying out enrichment analysis on the co-expression gene sequence to obtain multiple pathways involved in gene participation, and derive biological function interpretation of the co-expression gene sequence on data. According to the analysis system and method of the invention, the accuracy of data analysis is improved as a whole, the more valid genes of obvious expression differentiation are screened out in a differential gene processing process, and a category derived in a clustering process has more similarities on biological interpretation.

Description

technical field [0001] The invention belongs to the technical field of medical big data mining, and in particular relates to a US-ELM-based gene chip expression data analysis system and method. Background technique [0002] At present, gene chips have become an important research method in clinical research, and the results of data analysis directly affect doctors' diagnosis of diseases. At present, there are many related studies on gene chip data analysis, mainly focusing on finding differentially expressed genes / intersection analysis, data dimensionality reduction, cluster analysis and functional enrichment analysis. However, how to obtain the raw data of the gene chip and convert it into the data form required for various experimental purposes has become a key technical point. [0003] In the existing research on genetic data analysis algorithms, most of the genetic data processed come from public genetic databases, such as GEO databases. Due to the small sample size an...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/22G06F19/24G06N3/00
CPCG06N3/006G16B30/00G16B40/00
Inventor 王之琼李艳丽曲璐渲汪新蕾赵亚楠
Owner NORTHEASTERN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products