Unlock instant, AI-driven research and patent intelligence for your innovation.

Code generation method, code generating apparatus and computer readable storage medium

A computer and code technology, applied in the field of code generation, code generation device and computer-readable storage medium, can solve the problems of lowering sequencing accuracy, oligonucleotides not containing correct information, hindering the correct decoding of information, etc., so as to improve reliability Effect

Inactive Publication Date: 2017-09-12
THOMSON LICENSING SA
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

[0016] Long self-reverse complementary fragments may not be easily sequenced, which prevents proper decoding of information encoded in the strands
[0017] In addition, testing has shown that nucleotide run lengths (i.e., cascades or sequences of identical nucleotides) can reduce sequencing accuracy if the run length exceeds a certain length
[0018] In addition, many sequenced oligonucleotides may not contain correct information due to amplification processing and sequencing introducing errors at different positions in the oligonucleotide

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • Code generation method, code generating apparatus and computer readable storage medium
  • Code generation method, code generating apparatus and computer readable storage medium
  • Code generation method, code generating apparatus and computer readable storage medium

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0070] For better understanding, the present principle will now be explained in more detail in the following description with reference to the accompanying drawings. It shall be understood that the present principle is not limited to these exemplary embodiments and that specified features may be suitably combined and / or modified without departing from the scope of the present principle as defined in the appended claims.

[0071] refer to figure 1 , schematically shows an embodiment of a codebook generation method 100 for mapping multiple source codewords to multiple target codewords. The term "codeword" refers to a sequence of code symbols, such as binary or quaternary code symbols. A "source codeword" is used to provide pieces of information, such as a binary encoded bit stream, while a "target codeword" is a code symbol used to carry pieces of information in a format suitable for transcoding from which synthetic oligonucleotides are generated modulation sequence.

[0072]...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

A code book is generated for mapping source to target code words which allows encoding source data at reduced probability of incorrect decoding, e.g. for DNA storage. The target code words are grouped (102) into subsets and comprise identifying and remaining portions. The identifying portions of target code words corresponding to a same subset are identical. A first code symbol set of source code words is selected (103) for addressing the subsets. For the subsets, neighboring subsets are determined (104). The identifying portions of the target code words of neighboring subsets differ from those of the corresponding subset by up to a predetermined amount of symbols. Source code words are assigned (105) where the corresponding first code symbols address the same subset to said subset such that an amount of target code words of said subset having their remaining portions identical to their neighboring subsets corresponds to an optimization criterion.

Description

technical field [0001] A code generation method and device are proposed. In particular, the present disclosure relates to methods and apparatus for mapping source codewords to target codewords (eg suitable for encoding information for storage in synthetic nucleic acid strands), and to corresponding computer-readable storage media. Background technique [0002] Nucleic acids are polymeric macromolecules and consist of sequences of monomers called nucleotides. Each nucleotide consists of a sugar component, a phosphate group, and a nitrogenous base or nucleobase. A nucleic acid molecule whose sugar component of nucleotides is deoxyribose is a DNA (deoxyribonucleic acid) molecule, while a nucleic acid molecule whose sugar component of nucleotides is ribose is called an RNA (ribonucleic acid) molecule. DNA and RNA are biopolymers that occur in living organisms. [0003] Nucleic acid molecules are assembled into strings or chains of nucleotides. Nucleic acid molecules can be a...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Applications(China)
IPC IPC(8): G06N3/12G11C7/10G11C13/00H03M5/14
CPCG06N3/123G11C7/1006G11C13/0019H03M5/145
Inventor 陈晓明M.布拉瓦特K.盖德克I.许特尔
Owner THOMSON LICENSING SA