Depth unsupervised single cell clustering method based on Gaussian mixture graph variational auto-encoder

A Gaussian mixture, autoencoder technology, applied in neural learning methods, instruments, genomics, etc., can solve problems such as lack of single peak judgment, wrong definition, and neglect of interaction relationships

Pending Publication Date: 2022-07-22
NANKAI UNIV
View PDF0 Cites 3 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, each method has problems worth noting: chromVAR can only analyze groups of peaks, and lacks the judgment of individual peaks; while scABC relies heavily on label samples with high sequencing depth, and for data with many missing values, Especially for scATAC-seq data, the Spearman correlation coefficient may be misdefined
[0005] In short, most of the input data of existing single-cell clustering methods are cell gene expression or chromatin openness, ignoring the interaction relationship between these genes or open regions, and only using vector form as model input
Second, because the interaction between genes or open regions is not considered, existing methods can only achieve single-cell clustering and cannot simultaneously predict cell-type-specific gene regulatory networks.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Depth unsupervised single cell clustering method based on Gaussian mixture graph variational auto-encoder
  • Depth unsupervised single cell clustering method based on Gaussian mixture graph variational auto-encoder
  • Depth unsupervised single cell clustering method based on Gaussian mixture graph variational auto-encoder

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0063] The present invention will be further described in detail below with reference to the accompanying drawings and specific embodiments. It should be understood that the specific embodiments described herein are only used to explain the present invention, but not to limit the present invention.

[0064] like figure 1 As shown, the first aspect of the embodiment of the present invention provides a deep unsupervised single-cell clustering method based on Gaussian mixture graph variational autoencoder (VGAE), which is obtained by using a graph variational autoencoder based on Gaussian mixture distribution. A method for gene regulation network A, cell low-dimensional representation Z and cell clustering C, including the following steps:

[0065] Step 1: Input protein-protein interaction relationship PPIs, single-cell gene expression data X;

[0066] Step 2: Initialize trainable gene regulation network A using PPIs, the specific steps are as follows:

[0067] Step 2.1: Numbe...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a depth unsupervised single cell clustering method based on a Gaussian mixture graph variational auto-encoder. The method comprises the following steps: initializing a gene regulatory network A by using a protein-protein interaction relationship PPIs (or regulatory element interaction HiChIP); initializing a cell cluster C of each cell by using a K-means method; enabling the gene regulatory network A and single-cell gene expression data X (or regulatory element opening degree data X) to pass through a graph encoder to obtain a hidden layer; obtaining a cell cluster C, and sampling from a Gaussian mixture model GMM to obtain a cell low-dimensional representation Z; predicting a gene regulatory network A by using a decoder; calculating a loss function, performing back propagation to update A and GCN, and repeating the steps until convergence; and outputting a gene regulatory network A, a cell low-dimensional representation Z and a cell cluster C. Clustering of cells and dimensionality reduction of cell expression are completed in the process of constructing the gene regulatory network A.

Description

technical field [0001] The invention relates to the technical field of single-cell clustering, in particular to a deep unsupervised single-cell clustering method based on Gaussian mixture graph variational autoencoder. Background technique [0002] Single-cell sequencing technology refers to a technology for high-throughput sequencing analysis of the genome, transcriptome, and epigenome at the single-cell level. It plays an important role in tumor, developmental biology, microbiology, neuroscience and other fields, and is becoming the focus of life science research. One of the first tasks to be solved in single-cell sequencing is to identify the cell types contained in the sample, that is, to complete cell clustering based on unsupervised algorithms. Single-cell clustering enables the establishment of a comprehensive reference for all cell types at different developmental stages in an organism or tissue, which, in addition to providing a deeper understanding of basic biolog...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B40/30G16B20/00G16B25/10G06K9/62G06N3/04G06N3/08
CPCG16B40/30G16B20/00G16B25/10G06N3/084G06N3/088G06N3/045G06F18/2321G06F18/2155
Inventor 曾婉雯张爽范蕊
Owner NANKAI UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products