Single-cell RNA sequence deletion value filling method based on generative adversarial network
A missing value and single-cell technology, applied in the field of bioinformatics, can solve the problem of low accuracy of missing data filling, and achieve the effect of improving accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
specific Embodiment approach 1
[0042] Specific implementation mode 1. Combination figure 1 This embodiment will be described. A method for filling missing values of single-cell RNA sequences based on generating an adversarial network described in this embodiment, the method specifically includes the following steps:
[0043] Step 1. Construct a training set based on real RNA sequence data;
[0044] Step 2, constructing a generation confrontation network, the generation confrontation network includes a generator and a discriminator, wherein the generator is an autoencoder composed of an encoding module and a decoding module;
[0045] Use the training set to train the constructed generative confrontation network;
[0046] Step 3: After TPM normalization is performed on the RNA sequence data to be filled, the TPM normalized result is preprocessed (ie, gene selection and logarithmic transformation), and the trained generative confrontation network is generated according to the preprocessing result. RNA-seq...
specific Embodiment approach 2
[0047] Specific implementation mode two: the difference between this implementation mode and specific implementation mode one is that the specific process of the step one is:
[0048] Step 11, obtaining RNA sequence data from the data set Usoskin as a training set;
[0049] The data set Usoskin was published and shared by literature 1 (Usoskin, D. et al. Unbiased classification of sensory neuron types by large-scale single-cell rnasequencing. Nat. neuroscience 18, 145 (2015)).
[0050] Step 12. Each piece of RNA sequence data obtained is based on the set missing parameters to generate RNA sequence data with missing values, and label the generated RNA sequence data with missing values;
[0051]The label represents whether each gene locus is a missing value. If it is a missing value, the autoencoder learns towards the target data, so that the prediction result of the missing data is as close as possible to the target data, so as to obtain the trained autoencoder parameters. ; ...
specific Embodiment approach 3
[0056] Embodiment 3: This embodiment differs from Embodiment 1 or Embodiment 2 in that the pretreatment methods are gene selection and logarithmic transformation.
[0057] Other steps and parameters are the same as those in Embodiment 1 or Embodiment 2.
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


