Compressed storage and construction method of two-way multi-step deBruijn graph
A compression storage, two-way multi-step technology, applied in special data processing applications, instruments, electrical digital data processing, etc., to reduce the memory consumption of a single machine and improve the speed of splicing.
- Summary
- Abstract
- Description
- Claims
- Application Information
AI Technical Summary
Problems solved by technology
Method used
Image
Examples
Embodiment 1
[0058] A method for compressing storage and constructing a bidirectional multi-step deBruijn graph provided by an embodiment of the present invention, comprising
[0059] Compress storage steps, specifically
[0060] S11. Read a sequence s;
[0061] S12. Cut the sequence s into multiple segments t with a sliding window of length k; where 0<k<32 and k is an odd number.
[0062] S13. For each fragment t, use the nucleic acid coding table {A:00, C:01, G:10, T:11} to encode, and represent it as a 64-bit integer a;
[0063] S14. Reverse the character string of segment t, and use the symmetrical complementary table {A->T, C->G, G->C, T->A} to change each character in the reversed character string into its Complementary characters, obtain the string v of complementary characters, and use the nucleic acid coding table in step S13 again to encode the string v, and represent it as a 64-bit integer b;
[0064] S15, taking the maximum number of the integer a and the integer b as the ma...
Embodiment 2
[0078] Another embodiment of the present invention provides a compression storage and construction method of a bidirectional multi-step deBruijn graph, including
[0079] Compress storage steps, specifically
[0080] S11. Read a sequence s;
[0081] S12. Cut the sequence s into multiple segments t with a sliding window of length k; where 0<k<32 and k is an odd number.
[0082] S13. For each fragment t, use the nucleic acid coding table {A:00, C:01, G:10, T:11} to encode, and represent it as a 64-bit integer a;
[0083] S14, reverse the segment t, use the symmetrical complementation table to complement the reversed segment to obtain a complementary segment, and use the nucleic acid coding table in step S13 to encode the complementary segment again, and express it as a 64-bit integer b ;
[0084] S15, taking the maximum number of the integer a and the integer b as the marker number of the k molecules of the segment t and the complementary segment v;
[0085] S16. Steps S11-S...
PUM
Abstract
Description
Claims
Application Information
- R&D Engineer
- R&D Manager
- IP Professional
- Industry Leading Data Capabilities
- Powerful AI technology
- Patent DNA Extraction
Browse by: Latest US Patents, China's latest patents, Technical Efficacy Thesaurus, Application Domain, Technology Topic, Popular Technical Reports.
© 2024 PatSnap. All rights reserved.Legal|Privacy policy|Modern Slavery Act Transparency Statement|Sitemap|About US| Contact US: help@patsnap.com