Single cell integrated clustering method based on subspace randomization

A clustering method, single-cell technology, applied in the field of data mining in bioinformatics

Inactive Publication Date: 2021-04-30
HUNAN UNIV
View PDF2 Cites 6 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, these criteria can only capture the local similarity of cells

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Single cell integrated clustering method based on subspace randomization
  • Single cell integrated clustering method based on subspace randomization
  • Single cell integrated clustering method based on subspace randomization

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0040] The present invention is based on subspace randomization single cell integrated clustering method. Specific embodiments of the present invention are described below. Those skilled in the art should understand that these embodiments are only used to explain the technical principles of the present invention, and are not intended to limit the scope of evidence collection of the present invention.

[0041] Step 1: From the GEO database, download 8 types of single-cell sequencing data, including Buettner (single-cell RNA sequencing of mouse embryonic stem cells), Deng (single-cell sequencing of mouse early development), Kolodziejczyk, Chu (single-cell sequencing of human embryonic stem cells Cell Sequencing), Usoskin (single-cell RNA-sequencing data of mouse sensory neurons), Zeisel (large-scale single-cell RNA-sequencing of mouse somatosensory cortex and hippocampal CA1 region). As the initial input data, in order to eliminate the influence of different dimensions, we norm...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the field of data mining in bioinformatics, in particular to a single cell integrated clustering method based on subspace randomization. The method mainly comprises the following steps: (1) data preprocessing; (2) random subspace sampling is carried out to carry out cell similarity measurement; (3) subspace fusion is performed; and (4) single cell clustering is performed by measuring the overall similarity based on spectral clustering to obtain a final result. Compared with the prior art, the single cell clustering method provided by the invention is used for characterizing the novel cell type and detecting the heterogeneity in the population, and has stronger statistical ability and better stability. The method provided by the invention is feasible and effective, can achieve a good effect in the aspect of identifying the single cell cluster, and has important significance for researching cell type classification and identification of a complex data set.

Description

technical field [0001] The invention relates to the field of data mining in bioinformatics, in particular to a single-cell integrated clustering method based on subspace randomization. Background technique [0002] As the basic structural and functional unit of organisms, single cells store important genetic information. During the process of cell proliferation and differentiation, many factors can lead to the occurrence of cell heterogeneity, such as cell state, cell microenvironment and regulation of intracellular processes. Previously, bulk sequencing technology typically analyzed tens of thousands of cells in total, where the gene expression value was the average score of all cells. As a result, it typically highlights population cell types while masking rarer cell types such as stem cells and cancer cells. Fortunately, single-cell RNA sequencing (scRNA-seq) technology can extract transcriptome information at single-cell resolution, changing the traditional transcripto...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B40/30G06K9/62
CPCG16B40/30G06F18/23
Inventor 卢新国高妍李金鑫彭绍亮曾湘祥
Owner HUNAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products