Coding and decoding method for integrity check and error correction of DNA sequence

A DNA sequence and integrity verification technology, applied in the field of bioinformatics, can solve problems such as the limitation of error correction ability, destroy the purity of the original sequence, and difficult error correction ability, etc., and achieve the effect of flexible length

Active Publication Date: 2021-05-14
WUHAN UNIV
View PDF6 Cites 5 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

In order to provide such support, the existing methods usually make certain concessions, such as: having to sacrifice a part of the base sequence or introduce an additional base sequence as the carrier of the integrity check code, resulting in a decrease in information capacity or destroying the original sequence Another example: Most of the improved schemes based on the existing mature error-correcting codes occupy too many bases on the one hand, and on the other hand, the error-correcting ability is limited by factors such as the code distance. Designed as a fixed value, that is, at the beginning of the scheme design, the error correction capability is strictly limited, so it is difficult to expand to continue to explore its error correction capability

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Coding and decoding method for integrity check and error correction of DNA sequence
  • Coding and decoding method for integrity check and error correction of DNA sequence
  • Coding and decoding method for integrity check and error correction of DNA sequence

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0030] In order to facilitate those of ordinary skill in the art to understand and implement the present invention, the present invention will be described in further detail below in conjunction with the accompanying drawings and embodiments. It should be understood that the implementation examples described here are only used to illustrate and explain the present invention, and are not intended to limit this invention.

[0031] Such as figure 1 As shown, the encoding and decoding method of DNA sequence integrity checking and error correction of the present invention comprises the following steps:

[0032] Step 1: Encoding (embedding of integrity information). The DNA coder uses the DNA integrity coding algorithm, uses the key shared by the coder and the decoder and the codon bias table Table_CodonBias, the DNA sequence S to be integrity protected 0 Perform calculations and output the DNA sequence S embedded with integrity check information 1 ,See figure 2 .

[0033] The...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

PUM

No PUM Login to view more

Abstract

The invention discloses a coding and decoding method for DNA sequence integrity check and error correction. The coding and decoding method comprises a DNA integrity coding algorithm and a DNA integrity decoding algorithm. A coder uses a DNA integrity coding algorithm to code a DNA sequence to be subjected to integrity protection, and uses codon degeneracy to embed integrity verification information without changing corresponding amino acids, and does not introduce additional bases. After the synthesized DNA sequence is subjected to a biochemical process, a sequencing result has the possibility of introducing base insertion, deletion and replacement. A decoder can use a DNA integrity decoding algorithm to perform verification and error correction on a sequencing result. If the sequencing result of the sequence is error-free, the decoding algorithm must feed back that the sequencing result is error-free; if an error exists in the sequence sequencing result, the decoding algorithm judges that the error exists in an extremely high probability and corrects the error, and the recovered sequence capable of passing through the decoding algorithm is a decoding result; and if the error digits exceed the error digits willing to undertake by the decoders, the serious errors are prompted to be unrecoverable.

Description

technical field [0001] The invention belongs to the field of bioinformatics, and in particular relates to a coding and decoding method for DNA sequence integrity check and error correction. Background technique [0002] Bioinformatics is an interdisciplinary subject that uses the methods of applied mathematics, informatics, statistics and computer science to study biological problems. As early as the 1860s, the academic community proposed the concept of DNA-based data storage. After nearly 60 years of development, research related to DNA storage has gradually become an important branch in the field of bioinformatics. [0003] In terms of storage media for DNA storage, there are mainly two categories: in vivo-based information storage and in vitro information storage. The early enlightening research was limited by the level of DNA sequencing and synthesis technology at that time, using in vivo information storage methods, using living cells (such as bacteria, etc.) to carry ...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to view more

Application Information

Patent Timeline
no application Login to view more
Patent Type & Authority Applications(China)
IPC IPC(8): G16B20/20G16B30/10
CPCG16B20/20G16B30/10
Inventor 彭蓉王天宇崔竞松齐浩汪鹏程薛慧刘艺扬李嘉伟
Owner WUHAN UNIV
Who we serve
  • R&D Engineer
  • R&D Manager
  • IP Professional
Why Eureka
  • Industry Leading Data Capabilities
  • Powerful AI technology
  • Patent DNA Extraction
Social media
Try Eureka
PatSnap group products