Compression and decompression reduction method and system for CIGAR domain of SAM and BAM files and medium
A file and decompression technology, applied in the field of bioinformatics, can solve the problems of large optimization space in CIGAR domain, which has not been paid attention to, and achieve good compression effect, high compression ratio, and wide application range
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment Construction
[0034] As we all know, SAM and BAM files store the results of alignment of short fragment sequences and reference sequences by analysis software. In order to describe the alignment results, SAM and BAM files define the CIGAR domain. This field is the sixth field of SAM and BAM, which records the complete comparison information between the short fragment sequence and the reference sequence, and adopts the rules of number combination operators. For example, "100M", 100 indicates the length of the operator M, and the operator M indicates alignment matching. If the content of the CIGAR field is "100M", it means that the short fragment sequence starts from position 1 to the length of 100, and is consistent with the reference sequence from position POS From the beginning to the length of 100 alignment matches, the position POS value of the reference sequence is recorded in the fourth field of BAM, and which reference sequence (or chromosome) it corresponds to is indexed by the name o...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com