Unlock instant, AI-driven research and patent intelligence for your innovation.

A storage method for binary representation of gene information

A binary and binary number technology, applied in the field of bioinformatics, can solve problems such as occupying a large storage space and unfavorable genetic data analysis

Active Publication Date: 2018-08-10
MELUX TECH CO LTD
View PDF7 Cites 0 Cited by
  • Summary
  • Abstract
  • Description
  • Claims
  • Application Information

AI Technical Summary

Problems solved by technology

This traditional format storage not only takes up huge storage space, but also is not conducive to further analysis of genetic data (such as data mining of artificial intelligence)

Method used

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
View more

Image

Smart Image Click on the blue labels to locate them in the text.
Viewing Examples
Smart Image
  • A storage method for binary representation of gene information
  • A storage method for binary representation of gene information
  • A storage method for binary representation of gene information

Examples

Experimental program
Comparison scheme
Effect test

Embodiment Construction

[0018] The technical solutions in the embodiments of the present invention will be clearly and completely described below. Obviously, the described embodiments are only some of the embodiments of the present invention, but not all of them. Based on the embodiments of the present invention, all other embodiments obtained by persons of ordinary skill in the art without making creative efforts belong to the protection scope of the present invention.

[0019] See attached figure 1 , the present invention provides a technical solution:

[0020] DNA (Deoxyribonucleic acid), or deoxyribonucleic acid, is a molecule with a double-stranded double helix structure composed of deoxyribonucleotides (composition: deoxyribose, phosphoric acid and four nitrogenous bases). Genetic instructions can be composed to guide the development of organisms and the operation of life functions. The DNA fragments with genetic information are called genes. Deoxynucleotides are the basic structural and func...

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

PUM

No PUM Login to View More

Abstract

The invention discloses a storage method for conducting binary representation on gene information. The method includes the steps that characters represent matched base-pairs according to a DNA double-chain structure and base pairing; four types of base-pairs are subjected to numerical definition and represented with a two-bit binary number; a base group composed of three base-pairs is represented with an eight-bit one-byte binary number composed of a six-bit binary number and a two-bit fixed assignment; a linear mapping mode is adopted to evenly map 64 base groups, namely 0-63 base groups, to 0-255 base groups according to the formula Y=4X, or Y=4X+1, or Y=4X+2 or Y=4X+3, all numerical values ranging from 0 to 255 are converted into eight-bit binary numbers, and data information in the form of binary byte stream is stored.

Description

technical field [0001] The invention relates to the field of biological information, relates to a processing technology for data storage after gene detection, and relates to a storage method for binary representation of gene information. Background technique [0002] The maturity and popularization of high-throughput gene sequencing technology has gradually reduced the cost of genetic testing and faster sequencing time, and with the development and commercialization of the latest higher-throughput, faster, and lower-cost gene sequencing technology , Gene sequencing has entered the commercialization model of personal genetic testing. However, the data obtained from genetic testing is massive. In terms of the data storage format of the sequencing output, it is generally stored in the SAM (Sequence Alignment Map) / BAM (Binary Alignment Map) format, which can compactly represent the nucleotide sequence. This traditional format storage not only takes up huge storage space, but al...

Claims

the structure of the environmentally friendly knitted fabric provided by the present invention; figure 2 Flow chart of the yarn wrapping machine for environmentally friendly knitted fabrics and storage devices; image 3 Is the parameter map of the yarn covering machine
Login to View More

Application Information

Patent Timeline
no application Login to View More
Patent Type & Authority Patents(China)
IPC IPC(8): G06F19/28G06F19/26G06F19/24G16B45/00
CPCG16B40/00G16B45/00G16B50/00
Inventor 谢清禄徐宏锴朱军余孟春
Owner MELUX TECH CO LTD