DNA storage method based on oligonucleotide sequence code storage

An oligonucleotide and sequence coding technology, applied in sequence analysis, bioinformatics, informatics, etc., can solve the problems of time-consuming, complicated calculation, and reduce the storage capacity of DNA molecular data. The amount of information storage, the effect of reducing the cost of oligonucleotide synthesis and sequencing

Active Publication Date: 2019-05-31
SOUTHEAST UNIV
View PDF3 Cites 13 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

However, existing algorithms (such as Nature Biotechnology, 2018, 36(3), 242-248; Science, 2017, 355, 950–954, etc.) are encoding oligonucleotide sequences to meet the requirements of synthesis and sequencing oligonucleotide sequences , when realizing the storage of DNA molecules, the existing encoding methods either reduce the data storage capacity of DNA molecules (the maximum theoretical information storage capacity of DNA molecules is 2 bytes of a base storage binary file: if the quaternary DNA storage 4m The information is the same as the traditional binary storage 2k information, then m=k / 2); or the calculation is complicated and time-consuming when the sequence is encoded and the encoded sequence is restored to the original sequence

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • DNA storage method based on oligonucleotide sequence code storage
  • DNA storage method based on oligonucleotide sequence code storage
  • DNA storage method based on oligonucleotide sequence code storage

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0024] The specific implementation manners of the present invention will be further described in detail below in conjunction with the accompanying drawings.

[0025] The present invention designs a DNA storage method based on oligonucleotide sequence encoding storage, which is used for encoding and storing binary encoded files to be processed. In practical applications, such as figure 1 As shown, it specifically includes the following steps.

[0026]Step A. Divide the binary coded file to be processed in order to obtain each binary string segment to be processed, wherein at least the first binary string segment to be processed to the penultimate binary string segment to be processed, each to be The numbers of characters in the processed binary string segments are the same, and the numbers of characters in the binary string segments to be processed are all even numbers, and step B is entered.

[0027] Step B. Based on the DNA oligonucleotide sequence storage, obtain various co...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to a DNA storage method based on oligonucleotide sequence code storage. The DNA storage method based on oligonucleotide sequence code storage performs oligonucleotide sequence coding on different binary string segments in a variety of different base coding formats, and each set of binary coded file fragments can always find an oligonucleotide sequence code that satisfies therequirements for synthesis and sequencing, thus greatly simplifying coding of binary string segments of oligonucleotides and the conversion of coded oligonucleotides into binary string segments for operation, realizing efficient coding and decoding operations, and maximally utilizing the data storage capacity of DNA molecules while simplifying codes of binary string segments and the operation of restoring the coded oligonucleotide sequence to binary string segments. In addition, in coded binary character segments, the maximum theoretical value of each base coded binary is 2 bytes, thus reducing the number or length of oligonucleotide sequences, reducing the cost of oligonucleotide synthesis and sequencing, and having the largest information storage amount.

Description

technical field [0001] The invention relates to a DNA storage method based on oligonucleotide sequence code storage, belonging to the technical field of information storage. Background technique [0002] Existing documents such as characters and pictures are stored on storage media such as hard disks and magnetic tapes after adopting binary coded documents in the digital storage process. Human beings generate 2.5quintillion (million cubic) bytes of data every day, and this speed is still accelerating. IBM estimates that 90% of the data in the world is generated in the past two years. As human life becomes more and more More digital storage is used, and this trend will only continue to expand. The current storage devices can no longer meet the demand for huge amounts of data, so people start to use new storage media. [0003] DNA is often referred to as the "blueprint of life" for obvious reasons. Every cell in the human body is encoded by a sequence of bases (A, G, C, and ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B30/20
CPCY02D10/00
Inventor 肖鹏峰费中杰
Owner SOUTHEAST UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products