A Parallel Compression Method for Gene Sequencing Data Quality Score
A data quality and quality score technology, applied in bioinformatics, instrumentation, biostatistics, etc., can solve problems such as lack of practicability and low processing speed, and achieve the effect of enhancing practicability, improving processing speed, and strong applicability
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0030] The present invention will be further described below in conjunction with specific examples.
[0031] Such as figure 1 As shown, the method for parallel compression of gene sequencing data quality scores provided in this embodiment includes the following steps:
[0032] 1) In the input FASTQ file, each 4 lines represent a piece of gene sequencing information, such as figure 2 shown. The fourth line of the four lines is a quality score, which is equal to the length of the base sequence information in the second line, and each quality score represents the sequencing accuracy of the base data at the same position in the second line. Here only the quality score line is kept for the following compression process.
[0033] 2) For the extracted quality scores, the main thread calculates the score of each row, and the higher the score, the more high-frequency substrings are included. Quality scores are assigned to Category 1 or Category 2 in behavioral units according to t...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com