Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

Clustering method of gene microarray containing missing values based on joint training

A technology of gene microarray and clustering method, which is applied in the field of clustering of gene microarrays with missing values

Active Publication Date: 2021-04-27
SOUTH CHINA UNIV OF TECH
View PDF8 Cites 1 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In addition, for clustering tasks, how to select the best combination of imputation method and clustering method for gene microarrays with missing values ​​in an unsupervised environment is also a difficult problem

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Clustering method of gene microarray containing missing values based on joint training
  • Clustering method of gene microarray containing missing values based on joint training
  • Clustering method of gene microarray containing missing values based on joint training

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0042] Such as figure 1 As shown, this embodiment discloses a clustering method for gene microarrays containing missing values ​​based on joint training. First, a sample of the gene microarray data set is obtained, and the sample of the gene microarray data set is a data matrix, and the gene microarray data Divided into a training set and a test set, in the present embodiment, the data matrix is ​​composed of 500 row vectors, each row vector of the data matrix corresponds to the expression data sequence of the gene, and 500 represents the number of genes recorded in the data matrix; then, construct A deep neural network based on joint training of gene microarrays containing missing values, wherein the deep neural network includes a Sequence-To-Sequence encoding-decoding network and an adversarial learning module; use the training set to train the deep neural network and determine the deep neural network The learnable parameters in the test set are input into the deep neural ne...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a clustering method for gene microarrays containing missing values based on joint training; the method comprises the following steps: calculating the missing rate of gene microarray data, removing gene points with the missing rate exceeding 10%, and then dividing the gene microarray data into a training set and a test set; constructing a deep neural network, including a Sequence-To-Sequence encoding-decoding network and an adversarial learning module, and training the constructed deep neural network by using the training set to determine parameters of the deep neural network; inputting the test set into a deep neural network to obtain deep feature representation containing missing value gene microarray data in the test set; and finally, applying a K-means clustering algorithm to the deep feature representation to obtain a clustering result of the gene microarray containing the missing values. According to the method, an end-to-end framework can be provided for gene expression data clustering, and the difficulty that a proper filling method and a clustering method need to be selected to be combined in a traditional method is solved.

Description

technical field [0001] The invention relates to the technical field of gene microarrays, in particular to a clustering method for gene microarrays containing missing values ​​based on joint training. Background technique [0002] Clustering methods have always been of great experimental value in gene microarray research. For example, after obtaining the expression sequences of several genes, clustering the expression sequences of genes is a common experimental method. function to help. [0003] In the process of obtaining gene expression sequence data, missing values ​​often appear due to human omissions or uncontrollable experimental errors. The appearance of missing values ​​has greatly hindered the downstream research of gene microarrays. Resetting the experimental environment to obtain a complete gene expression sequence is expensive and impractical. Therefore, how to mine information that is beneficial to analysis from gene expression data with missing values ​​is v...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
IPC IPC(8): G16B40/00G16B25/00G06N3/04G06N3/08G06K9/62
CPCG16B40/00G16B25/00G06N3/08G06N3/045G06F18/23213G06F18/214
Inventor 马千里陈楚鑫
Owner SOUTH CHINA UNIV OF TECH
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products