System and method for clustering gene expression data based on manifold learning

A gene expression and manifold learning technology, applied in the field of gene expression data clustering system, can solve problems such as inability to apply gene expression data, inability to find local information in data, etc.

Inactive Publication Date: 2011-09-14
HOHAI UNIV
View PDF2 Cites 38 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] In the above gene expression data clustering algorithms, the traditional clustering algorithm clusters from a single direction and cannot discover the local information of the data; the double clustering algorithm can effectively mine the local information, but it cannot be applied to higher-dimensional gene expression data; and the manifold learning algorithm is a relatively new technology with a very broad application prospect. In comparison, it can overcome the influence of high-dimensional features of gene expression data

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • System and method for clustering gene expression data based on manifold learning
  • System and method for clustering gene expression data based on manifold learning
  • System and method for clustering gene expression data based on manifold learning

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0046] The structure and working process of the present invention will be described in detail below in conjunction with the accompanying drawings.

[0047] Such as figure 1 As shown, the present invention provides a clustering system of gene expression data based on manifold learning, including a gene expression data acquisition system S and a computer C, and the gene expression data acquisition system S is connected to the computer C through a USB data line 8, The system S comprises a hollow housing 7, the bottom of the housing 7 is provided with a support 6, and a light-transmitting glass slide 5 is placed on the support 6, and the light-transmitting glass slide 5 is used to place the microarray chip 2; and the housing 7 A laser scanner 4 is arranged on the top of the laser scanner 4, and the head of the laser scanner 4 is embedded in the casing 7;

[0048] The working principle of the aforementioned clustering system is: when acquiring gene expression data, the sample to b...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a method for clustering gene expression data based on manifold learning, and the method provided by the invention comprises the following steps: acquiring a gene expression data matrix A through an acquisition system, and preprocessing the gene expression data matrix A by using a local linear smoothing algorithm; introducing the preprocessed data matrix A, and constructing a weighted neighborhood figure G in a three-dimensional space; taking the shortest path between two points as the approximate geodesic distance between two points; calculating a two-dimensional embedded coordinate by using an MDS (minimum discernible signal), and mapping the three-dimensional data matrix A to a two-dimensional visual space; and carrying out clustering on the two-dimensional visual space subjected to mapping by using a k-mean clustering algorithm so as to obtain the clustering result. The clustering method has the characteristics of low calculating cost, capability of eliminating high-order redundancies, suitability for pattern classification tasks, and the like; and by using the method disclosed by the invention, the current states of cells, the effectiveness of medicaments to malignant cells, and the like can be discriminated effectively according to the clustering result. The invention also provides a system for clustering gene expression data based on manifold learning.

Description

technical field [0001] The invention belongs to the field of data mining and processing, and in particular relates to a clustering system and method of gene expression data based on manifold learning. Background technique [0002] The progress of science, especially the rapid development of bioinformatics, has brought us into a new era. As one of the core and frontier fields of life sciences and natural sciences, human beings still know little about the mysteries of gene sequences. Bioinformatics and related content still have a long way to go. High-throughput microarray detection technology is a significant breakthrough in biological experiment technology in recent years. With this technology, the transcription levels of thousands of genes can be analyzed in parallel at the same time. Data from large-scale gene expression experiments. Due to the variety of cells and the temporal and spatial specificity of gene expression, the gene expression data is more complex, the data...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06F19/24
Inventor 孙周宝韩立新
Owner HOHAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products