Quality control method based on pacbio full-length transcriptome sequencing data
A technology of transcriptome sequencing and quality control method, which is applied in the field of quality control based on PacBio full-length transcriptome sequencing data, which can solve the impact of accuracy, chimeric sequences cannot be correctly identified, primer sequences are not filtered, and chimeric sequences cannot be determined. and other problems to achieve the effect of improving the accuracy
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0030] The sequencing data in this example includes 23G of PacBio full-length transcriptome sequencing data of pine pine, and Illumina sequencing data of 3 biological replicates of pine pine samples, and the data volume of each replicate is not less than 6G.
[0031] Analyze the data according to the quality control method of the present invention, filter possible chimeric sequences, and obtain the final transcriptome. The specific method is:
[0032] (1) Using the IsoSeq analysis process to obtain high-quality and low-quality consistent full-length sequences from the original PacBio full-length transcriptome sequencing data;
[0033] (2) Based on the Illumina sequencing data, use proovread to correct the low-quality consistent full-length sequences, and filter the sequences whose average accuracy of the corrected sequences is less than 0.99;
[0034] (3) Merge high-quality and low-quality consistent full-length sequences that meet the conditions after correction, and count t...
Embodiment 2
[0039] The sequencing data of this example includes 21.88G of PacBio full-length transcriptome sequencing data of 1 lemon mixed sample, and the Illumina sequencing data of 3 separate samples (3 biological replicates for each sample) in the mixed sample. Less than 6G.
[0040] Analyze the data according to the quality control method of the present invention, filter possible chimeric sequences, and obtain the final transcriptome. The specific method is:
[0041] (1) Using the IsoSeq analysis process to obtain high-quality and low-quality consistent full-length sequences from the original PacBio full-length transcriptome sequencing data;
[0042] (2) Use proovread to correct low-quality consistent full-length sequences based on Illumina sequencing data, and filter sequences with an average accuracy of less than 0.99 after correction;
[0043] (3) Merge high-quality and low-quality consistent full-length sequences that meet the conditions after correction, and count the lengths ...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com