DNA-based Information Storage Method

An information storage and DNA sequence technology, applied in the field of information storage, can solve the problems of high replication cost, unbalanced, poor base mutation rate, etc., and achieve the effect of improving the carrying rate, mass replication, and easy implementation.

Active Publication Date: 2019-03-12
TIANJIN UNIV
View PDF7 Cites 15 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0006] The existing DNA information storage technology is limited by DNA synthesis technology, mainly focusing on short sequences (below 200bp), and its storage technology has the following disadvantages: simple encoding (error-correcting code) (mainly Huffman+RS error-correcting code), DNA sequence Low actual loading rate (caused by other invalid sequences such as primers, addresses, and redundancy, the actual loading rate is usually below 60%), high replication cost (PCR amplification requires a large amount of DNA polymerase when a large amount of replication is required, and the cost is high) and poor replication fidelity (high base mutation rate and possible uneven amplification during PCR)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • DNA-based Information Storage Method
  • DNA-based Information Storage Method
  • DNA-based Information Storage Method

Examples

Experimental program
Comparison scheme
Effect test

Embodiment

[0041] 1. Information coding

[0042] In this embodiment, the file "Excerpts from Tianda Regulations.txt" is selected for encoding, and the excerpts include:

[0043] Statutes of Tianjin University

[0044] preamble

[0045] Tianjin University, formerly known as Beiyang University, was founded on October 2, 1895. It is the earliest institution of higher learning in the history of modern higher education in my country. In September 1951, it was named Tianjin University with the approval of the Central People's Government Administration Council after the adjustment of the national higher education departments. In 1959, it was identified as a national key university by the Central Committee of the Communist Party of China. In 2000, it was identified as a high-level research university under the key construction of the national "985 Project".

[0046] Self-improvement starts with accumulating talents, and accumulating talents must first establish schools. The purpose of the s...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention relates to the technical field of information storage, in particular to a DNA-based information storage method. A long sequence in vivo DNA information storage technique is provide. Themain goal is to construct a coding system with strong error correction mechanism based on LDPC and BCH codes, which can reduce the redundancy of primers and indexes by long sequence coding, and achieve a high actual carrying capacity (above 97%). The main goal is to construct a medium-length DNA sequence (above 1Kbp). Saccharomyces cerevisiae in vivo assembly system is used to assemble and store long sequences and preserve information, and the information can be replicated at low cost, high fidelity and high speed by model organisms such as Saccharomyces cerevisiae, Escherichia coli or Bacillus. At the same time, because of the existence of strong error correction system, the data in the bacterial cell can be reduced perfectly with low coverage (1-5 X) in the second generation and third generation sequencing.

Description

technical field [0001] The invention relates to the technical field of information storage, in particular to a DNA-based information storage method. Background technique [0002] Human society has entered the era of information explosion, and has higher requirements for storage media and storage systems. According to the survey jointly conducted by International Data Corporation and EMC, the total amount of global data is growing rapidly at an annual growth rate of 50%. In 2017, the total amount of global data has reached 16ZB, which is equivalent to the total amount of global data ten years ago. 100 times. According to the survey estimates, by 2020, the total amount of global data will reach 44ZB, which is equivalent to the combined storage capacity of 45,000 existing Amazon data storage centers. [0003] Modern digital computers store, communicate, and manipulate binary data, often represented as 0s and 1s. The storage and transmission of these bits of information are l...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/12G06F11/10
CPCG06F11/1048G06N3/123
Inventor 元英进韩明哲陈为刚章新晨
Owner TIANJIN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products