Looking for breakthrough ideas for innovation challenges? Try Patsnap Eureka!

A clustering recommendation method for single-cell transcriptome sequencing data based on two-dimensional distribution structure determination

A transcriptome sequencing and two-dimensional distribution technology, applied in the field of bioinformatics, can solve problems such as differences in clustering results, dependence on the accuracy of similarity matrix, etc., and achieve the effect of improving the accuracy of clustering

Active Publication Date: 2022-04-15
CENT SOUTH UNIV
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

Therefore, spectral clustering can deal with more complex data distribution structures, such as fuzzy boundary problems, but the disadvantage of the method is that it relies heavily on the accuracy of the similarity matrix
[0005] Because the two clustering methods are based on different theories and strategies, there may be differences in the clustering results on data with different distribution structures

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A clustering recommendation method for single-cell transcriptome sequencing data based on two-dimensional distribution structure determination
  • A clustering recommendation method for single-cell transcriptome sequencing data based on two-dimensional distribution structure determination
  • A clustering recommendation method for single-cell transcriptome sequencing data based on two-dimensional distribution structure determination

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0032] The following is a detailed description of the embodiments of the present invention. This embodiment is carried out based on the technical solution of the present invention, and provides detailed implementation methods and specific operation processes to further explain the technical solution of the present invention.

[0033] This embodiment provides a single-cell transcriptome sequencing data clustering recommendation method based on two-dimensional distribution structure determination, including the following steps:

[0034] Step 1, obtain the single-cell transcriptome sequencing data of N cells, and obtain the gene expression matrix X=[x 1 ,x 2 ,...,x N ],x i =[x i1 ,x i2 ,...,x im ], i=1,2,...,N, m represents the number of genes in the cell, x i1 ,x i2 ,...,x im Indicates the expression levels of cell i in m genes respectively; delete the genes whose expression level is 0 in the gene expression matrix X to complete the filtering, and then standardize the fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a single-cell transcriptome sequencing data clustering recommendation method based on two-dimensional distribution structure determination, comprising: obtaining a gene expression matrix obtained from single-cell transcriptome sequencing data of multiple cells, after filtering and standardization, Construct a two-dimensional feature matrix and perform linear normalization; calculate the Euclidean distance between cells according to the normalized two-dimensional feature matrix, so as to establish the minimum spanning tree of cells; cut the minimum spanning tree of cells by adaptive threshold, and use The balance of the clusters formed after cutting is used to determine the two-dimensional distribution structure of the data; for data with fuzzy inter-cluster boundaries and continuous two-dimensional distribution structure, a hierarchical clustering algorithm is recommended and applied, while for data with obvious inter-cluster boundaries and block For data with two-dimensional distribution structure, recommend and apply spectral clustering algorithm. The present invention can recommend a method that is more suitable for the two-dimensional distribution structure of single-cell transcriptome sequencing data in hierarchical clustering and spectral clustering as a downstream clustering analysis method, thereby improving clustering accuracy.

Description

technical field [0001] The invention relates to the field of bioinformatics, and relates to a single-cell transcriptome sequencing data clustering recommendation method based on two-dimensional distribution structure determination. Background technique [0002] In the field of cell biology, single-cell analysis is the study of genomics, transcriptomics, proteomics, and metabolomics at the single-cell level. It provides an ultrasensitive tool to elucidate specific molecular mechanisms and pathways and reveal the nature of cellular heterogeneity. With the development of technology and the decline of cost, transcriptome sequencing (scRNA-seq) technology applied to single-cell whole genome is rapidly becoming the choice in many fields such as biology and biomedical research. Studying genome-wide gene expression at single-cell resolution overcomes the inherent limitations of traditional RNA-sequencing, and single-cell transcriptome sequencing enables researchers to more rigorous...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G16B40/00G16B35/00G16B30/00G06K9/62
CPCG16B40/00G16B30/00G16B35/00G06F18/231G06F18/2323
Inventor 李敏田宇郑瑞清
Owner CENT SOUTH UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Patsnap Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Patsnap Eureka Blog
Learn More
PatSnap group products