Method for judging saturation of sequencing data, computer readable medium and application
A technology of sequencing data and data, applied in the field of sequencing, can solve problems such as inability to clearly reflect the saturation of sequencing data, uneven data volume, etc., and achieve the effect of accurate judgment and guaranteed accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0068] Using the 16S sequencing data of 316 cases, using the Illumina sequencing platform, using the PE250 sequencing strategy, using 97% sequence similarity as the threshold, clustering to generate a Cluster, and calculating the S value to draw its distribution map, with the mean value plus 1.282 times the variance As the threshold, define the index Saturated to measure data saturation. The interval range is 0 to 0.44, that is, the Saturated value is less than 0.44, the sequencing data is saturated, and greater than or equal to 0.44, the sequencing data is not saturated. The result is as follows figure 2 shown.
[0069] In addition, 22 cases of 16S sequencing data were used, using the Illumina sequencing platform, using the PE250 sequencing strategy, with 97% sequence similarity as the threshold, clustering to generate Clusters, and drawing the dilution curve change curve to calculate the Saturated value, according to figure 2 The obtained reference value of 0.44 judges whe...
Embodiment 2
[0072] Using 261 Arabidopsis transcriptome sequencing data, using the Illumina sequencing platform, using the SE50 sequencing strategy, using 95% sequence similarity as the threshold, clustering to generate a Cluster, and calculating the Saturated value to draw its distribution map, adding the mean value The upper 1.282 times the variance is the threshold, and the range of Saturated, an indicator for measuring data saturation, is defined as 0 to 0.48, that is, the Saturated value is less than 0.48, the sequencing data is saturated, and greater than or equal to 0.48, the sequencing data is not saturated, and the result is as follows image 3 shown.
[0073] In addition, the transcriptome sequencing data of 10 cases were used, using the Illumina sequencing platform, using the SE50 sequencing strategy, using 95% sequence similarity as the threshold, clustering to generate Clusters, and drawing the dilution curve change curve to calculate the Saturated value, according to image 3...
PUM
Login to View More Abstract
Description
Claims
Application Information
Login to View More 


