A fully automated clustering method for flow cytometry based on density and nonparametric clustering

A flow cytometry, non-parametric technology, applied in the field of medical data processing and flow cytometry data analysis, can solve the problems of artificial clustering error, unfavorable new cell population discovery and mining, poor clustering ability, etc., to eliminate noise. Interference and non-specific signals, saving automatic grouping time, and fast dimensionality reduction

Active Publication Date: 2022-07-15
ZHEJIANG BOZHEN BIOTECH CO LTD
View PDF0 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In this algorithm, the PCA dimensionality reduction speed is fast, and it is suitable for the situation where the positive group and the negative group are very clearly grouped. Once the positive and negative groups are not sufficiently separated, the groups after dimensionality reduction will overlap, resulting in difficulty in clustering; K-means clustering The class accuracy is poor, and it is only suitable for accurate clustering to distinguish cell populations that are distributed in a circular manner after dimensionality reduction, and the cell populations that are irregularly distributed after dimensionality reduction have extremely poor clustering ability, and the K-means algorithm needs to specify cell populations in advance Quantity, which is an obstacle to automatic clustering, will introduce artificial clustering errors, which is not conducive to the automatic process and the discovery and mining of new cell populations
There are also methods that do not involve dimensionality reduction algorithms, and directly use the neural network model to perform grouping and cell property judgment in multidimensional space, which will cause a large loss in accuracy and performance.

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A fully automated clustering method for flow cytometry based on density and nonparametric clustering
  • A fully automated clustering method for flow cytometry based on density and nonparametric clustering
  • A fully automated clustering method for flow cytometry based on density and nonparametric clustering

Examples

Experimental program
Comparison scheme
Effect test

Embodiment 1

[0043] Bone marrow sample from a leukopenic patient, 10-color scheme, according to the present invention figure 1 As shown in the method, obtain the flow FCS or LMD file, read the data, and organize the data of each fluorescence channel combined with FCS (forward scattered light) and SSC (side scattered light) into tabular data, each row represents a cell, and each column Represents the fluorescence signal or physical parameter value of the corresponding channel of the cell, and the TIME column represents the time point when the cell was acquired. The UMAP algorithm is used to quickly reduce the dimension; the actual measurement of 50,000 12-dimensional flow data, the UMAP dimension reduction takes an average of 35.45 seconds. , t-SNE dimensionality reduction takes an average of 173.98 seconds.

[0044] According to the cell population density distribution after dimensionality reduction, the DBSCAN algorithm is used for clustering, and the clustering diagram is as follows fi...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a flow cytometry automatic clustering method based on density and non-parameter clustering, which belongs to the technical field of medical data processing and flow cytometry data analysis, and takes into account the characteristics of different algorithms and flow cytometry data analysis. The process does not require the user to specify the number of cell groups in the whole process, which is conducive to the automatic process and the discovery and mining of new cell groups; the dimensionality reduction speed is fast, and the UMAP dimensionality reduction speed is 2-10 times faster than the t-SNE dimensionality reduction, which greatly saves the time for automatic clustering ; DBSCAN combined with FlowPeaks algorithm can accurately distinguish cell populations of any shape, and can effectively eliminate noise interference and non-specific signals.

Description

technical field [0001] The invention relates to the technical field of medical data processing and flow cytometry data analysis, in particular to a flow cytometry automatic clustering method based on density and nonparametric clustering. Background technique [0002] Existing cell clustering methods are based on principal component analysis (PCA) dimensionality reduction and K-means clustering. In this algorithm, PCA has a fast dimensionality reduction speed, which is suitable for the case where the positive and negative groups are clearly grouped. Once the positive and negative groups are not sufficiently separated, the groups after dimensionality reduction will overlap, resulting in difficulty in clustering; K-means clustering The class accuracy is poor, and it is only suitable for accurate clustering to distinguish the cell groups with a circular distribution after dimensionality reduction. The clustering ability of the irregularly distributed cell groups after dimensiona...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Patents(China)
IPC IPC(8): G01N15/14
CPCG01N15/1429
Inventor 倪万茂林鹏程迟妍妍倪万根陈乐芝陈鹏贵陈慧项艺超
Owner ZHEJIANG BOZHEN BIOTECH CO LTD
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products